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(STVAbstract: The present invention relates to murine genomic regions, identified by retroviral insertional tagging of mice as being 
involved in tumor development, in particular leukemia development, as well as human homologues thereof, and to the use of these ge- 
nomic regions for the identification and development of anti-cancer drugs, such as small molecule inhibitors, antibodies, ribozymes, 
antisense molecules and RNA interference (RNAi) molecules, that are effective in reducing or eliminating the tumorigenic effects 
of genetic transformations in these genomic regions and/or eliminating the tumorigenic effects of expression products thereof. The 
invention further relates to these an ti -cancer drugs and to their use as pharmaceutical reagents for the treatment of cancer, as well as 
to pharmaceutical compositions comprising one or more of said pharmaceutical reagents and to methods for the treatment of cancer 
using said pharmaceutical compositions, in particular to methods of gene therapy. In yet further aspects, the invention relates to nu- 
cleic acids, to antibodies capable of binding specifically to murine genomic regions and to expression products thereof, to the use of 
said nucleic acids or antibodies as diagnostic reagents for the diagnosis of cancer, as well as to diagnostic compositions comprising 
one or more of said diagnostic reagents and to methods for the diagnosis of cancer using said diagnostic compositions. 
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Title: Use of murine genomic regions identified to be involved in tumor 

development for the development of anti-cancer drugs and diagnosis 
of cancer 

This application is a continuation-in-part of U.S. Patent Application Serial No. 
10/252,132 filed September 19, 2002. 

FIELD OF THE INVENTION 

5 The present invention relates to murine genomic regions, identified by 

retroviral insertional tagging of mice as being involved in tumor development, 
in particular leukemia development, as well as human homologues thereof, 
and to the use of these genomic regions for the identification and development 
of anti*cancer drugs, such as smaU molecule inhibitors, antibodies, ribozymes, 

10 antisense molecules and RNA interference (RNAi) molecules, that are effective 
in reducing or eliminating the tumorigenic effects of genetic transformations in 
these genomic regions and/or eliminating the tumorigenic eiffects of expression 
products thereof. The invention further relates to these anti-cancer drugs and 
to their use as pharmaceutical reagents for the treatment of cancer, as well as 

15 to pharmaceutical compositions comprising one or more of said pharmaceutical 
reagents and to methods for the treatment of cancer using said pharmaceutical 
compositions, in particular to methods of gene therapy. In yet further aspects, 
the invention relates to nucleic acids derived from said muxine genomic 
regions involved in timior development or fragments thereof, to antibodies 

20 raised against the esipression products of genes the sequence of which is 

comprised in said murine genomic regions or human homologues thereof, to 
antibodies raised against the expression products of genes affected by genetic 
transformations in said murine genomic regions or human homologues thereof, 
to the use of said nucleic acids or antibodies as diagnostic reagents for the 

25 diagnosis of cancer, as well as to diagnostic compositions comprising one or 
more of said diagnostic reagents and to methods for the diagnosis of cancer 
using said diagnostic compositions. 
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BACKGROUND OF THE INVENTION 

After a quarter century of rapid advances, cancer research has 
generated a rich and complex body of knowledge revealing cancer to be a 
5 disease involving dynamic changes in the genome. Cancer is thought to resxilt 
from six essential alterations in cell physiology that collectively dictate 
maUgnant growth: self-sufficiency in growth signals, insensitivity to growth- 
inhibitory (anti-growth) signals, evasion of programmed cell death (apoptosis), 
limitless replicative potential, sustained angiogenesis, and tissue invasion and 
10 metastasis. 

In general, these essential alterations are the result of mutations in 
genes involved in the control of these cellular processes. These mutations 
include deletions, point mutations, inversions and amplifications. The 
mutations result in either an aberrant level, timing, and/or location of 

15 expression of the encoded protein or a change in function of the encoded 
protein. These alterations can affect cell physiology either directly, or 
indirectly, for example via signaling cascades. 

The possibilities to identify genes that promote the transition firom a 
normal cell into a malignant cell is an essential prerequisite for the 

20 development of novel therapies for the treatment of cancer. 

One of the most common therapies for the treatment of cancer is 
chemotherapy. Herein, the patient is treated with one or more drugs that 
function as inhibitors of cellular growth and are therefore intrinsically toxic. 
Since cancer cells are among the fastest growing cells in the body, these cells 

25 are severely affected by the applied drugs. However, normal cells are also 

affected which results - besides toxicity - in very severe side-effects such as loss 
of fertility. 

Another commonly used therapy to treat cancer is radiation therapy or 
radiotherapy. Radiotherapy uses high energy rays to damage cancer cells 
30 resulting in the induction of cell cycle arrest. Cell cycle arrest will ultimately 
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result in programined cell death (apoptosis). However, also normal cells are 
irradiated and damaged. In addition, it is difficult to completely obliterate all 
tumor cells using this therapy. Importantly, very small tumors and developing 
metastases cannot be treated by radiotherapy. Moreover, irradiation can cause 
5 mutations in the cells surrounding the tumor which increases the risk of 
developing new timiors. Combinations of both chemo- and radiotherapy are 
also frequently used and a subsequent accumvdation of side-effects is observed. 

The major disadvantage of both therapies is that they do not 
discriminate between normal and timior cells. Furthermore, tumor cells have 
the tendency to become resistant to these therapies, especially to 
chemotherapy. 

Therapies directed at tumor-specific targets would increase the 
efficiency of the therapy due to i) a decrease in the chance of developing drug 
resistance, ii) a reduced toxicity since the drugs used in these tumor-specific 
therapies are appHed at much lower concentrations and iii) observed side- 
effects are reduced since only tumor cells are affected. 

The use of tumor-specific therapies is limited by the number of targets 
known. Since tumors may arise from a large niunber of different and discrete 
changes in the genome, the genot3rpe between tumor cells may vary 
considerably. The disease type that they cause may however still be classified 
as the same. This is one of the main reasons why not a single therapy exists 
that is effective in all patients with a certain type of cancer. 

Pharmacogenetics and pharmacogenomics aim at determining the 
genetic determinants liiiked to diseases. Most of the disease are multigenic 
diseases, and the identification of the genes involved therein should allow for 
the discovery of new targets and the development of new drugs. 
Pharmacogenomics also encompasses the use of specific medications according 
to the genot3^e of the patient. This shotdd lead to a dramatic improvement of 
the efficiency of the drugs. 
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Many physiological diseases are targeted by this novel pharmaceutical 
approach. One can name the autoimmune and inflammatory diseases, for 
example Addison's Disease, Alopecia Areata, Ankylosing Spondylitis, Behcet's 
Disease, Chronic Fatigue Syndrome, Crohn's Disease, Ulcerative Colitis, 
5 Inflammatory Bowel Disease, Diabetes and Multiple Sclerosis. 

As stated, cancers are also believed to be multigenic diseases. Some 
oncogenes (for exemple ras, c-myc) and tumor suppressor genes (for example 
p53) have previously been identified, as weU as some genetic markers for 
predisposition (for example the genes BRCAl and BRCA2 for breast cancer). 
10 The identification of new genes involved in other kinds of cancer should allow 
for a better information of the patient and the prevention of the development 
of the disease itself, an improved life expectancy as already observed with 
breast cancer (Schrag et al 2000. JAMA 283: 617-24). 

Knowledge of the identity of genes involved in cancer development 
15 therefore greatly facilitates the development of prophylactic, therapeutic and 
diagnostic methods for this disease. Diagnosis of the affected genes in a certain 
tumor type allows for the design of therapies comprising the use of specific 
anti-cancer drugs, for instance, drugs directed against the proteins encoded by 
these genes. 

20 Presently, only a limited number of genes involved in tumor 

development are known and there is a clear need for the identification of novel 
genes involved in tumor development to be used in the design of tumor-specific 
therapies and to define the genotype of a certain tvimor. 

In the research that led to the present invention, a number of genomic 

25 regions were identified to be involved in tumor development by proviral 

tagging. Proviral taggiag (Berns. 1988. Arch Virol. 102: 1-18; Kim et al 2003. J 
Virol. 77:2056-62) is a method that uses a retrovirus to infect normal 
vertebrate cells. After infection, the virus integrates into the genome thereby 
disrupting the local organization of the genome. This iategration is random 

30 and affects the expression or function of genes, depending on the integration 
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site of the virus, which may for instance be in a coding region, a regulatory 
region or a region nearby a gene. If a cellular gene involved in tumor 
development is affected, the cell will acquire a selective advantage to develop 
into a tumor as compared to cells in which no genes involved in tumor 
5 development are affected. As a result, all cells within the tumor originating 
from the cell affected in a gene involved in tumor development wiU carry the 
same proviral integration. Through analysis of the region nearby the retroviral 
integration site, the affected gene can be identified. Due to the size of the 
genome and the total number of integration sites investigated in the present 

10 invention, a gene that is affected in two or more independent tumors 

investigated must provide a selective advantage and therefore contribute to 
tumor development. Such sites of integration are designated as common 
integration sites (CIS). 

By using an improved method of proviral tagging the present inventors 

15 were able to identify a large nimiber of murine genomic regions involved in the 
development of myeloid leukemia in mice. All such genomic regions comprised 
common integration sites of the virus nucleic acid and such sites were either 
found inside, before or after genomic sequences that coded for known or 
putative murine genes, or in yet xmknown but defined positions within the 

20 murine genome. Many of the murine genomic regions, as defined by the 

presence therein of a common integration site, have so far not been reported to 
be involved in timior development. Novel cancer related murine genomic 
regions identified in this manner are the following: Adamll, AI462175, Cd24a, 
Edg3, Itgp, Kcnjie, KcnkS, Kcnn4, Ltb, Lyl08, Ly6i, mouse homologue of 

25 EMILIN, Mrcl, Ninj2, Nphsl, Sema4b, Tm9sf2, and Tnfrsfl 7, encoding cell 
surfkce proteins; Apobec2, Btd, Cds2, Clpx, Ddxl9, Ddx21, Dnmt2, Dqxl, 
HdacTa, Lce-pending, Mgatl, mouse homologue of CILP, mouse homologue of 
NOH61, Nudel'pending, Pah, Pdil, Ppia, Prpsl, Ptgds, and Vars2, encoding 
enzymes; Dagk4, mouse homologue of PSK, Nme2, Snfllk and Tyki, encoding 

30 kinases; DusplO, Inpp4a and InppSb, encoding phosphatases; ElS, Prg, and 
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Scya4, encoding secreted factors; Akap7, Api5, Arfrpl, ArhgapM-pending, 
CishS, Dappl, Fabp6, Fkbp8, Flizl-pending, Hint, lerS, Jundp2-pending, 
Lmo6. Midi, mouse homologue oiAKAPlS, mouse homologue otBIN2, mouse 
homologue of CEZANNE, mouse homologue of CHD2, moxise homologue of 
5 MBLL, mouse homologue oi SLC16A10, mouse homologue ot SLC16A6, mouse 
homologue of SLCl 7A5, mouse homologue of TAF5L, mouse homologue of 
UlSNRNPBP, mouse homologue of ZNF8, Mtap7, Myolc, Nkx2-3. Nsf, Pcdh9. 
Pkig, Prdx2, Pscdl, Psmbl, Psmel, Psme2, Rgll, Ril-pending, Saxl, Slcl4a2, 
Slc7al, Slc7all, Swap70, Txnip, and Ubl3, encoding signaling proteins; Clic3, 

10 Gtll-13, mouse homologue of NOL5A, axidydacB, encoding structural proteins; 
ABTl-pending, Ctbpl, Dermol, Ebf, Elf 4, Ldbl, mouse homologue otNRlDl, 
mouse homologue of ZER6. Rest, Tbp, Zfp238, Zfp287, and Zfp319, encoding 
proteins involved in transcriptional regulation; Lrrc2, Satbl, Slfn.4, and 
genomic regions with the following Celera identification codes mCGl0290, 

15 mCG10613, mCG11234, mGGll325, mCG11355, mCG11803, mCG11817, 
mCGl2566, mCG12630, mCGl2824, mCG13346, mCG14143, mCG14155, 
mCG14342, mCGl5141, mCGl5321„ mCGl6761, mCG16858, mCGl7127. 
mCGl7140, mCG17142, mCG17547, mCGl7569, mCGl7751, mCGl7799, 
mCG17802, mCGl7918, mCG18034„ mCG1850, mCGl8663, mCGl8737, 

20 mCG20276. mCQ20905, mCG20994, mCG21403, mCG21505, mCG21529, 
mCG21530. mCG21803, mCG22014, mCG22045, mCG22386, mCG2258, 
mCG22772, mCG23032, mCG23035, mCG23069, mCG23075, mCG23120, 
mCG2543, mCG2824, mCG2947, mCG3038, mCG3729, mCG3760, mCG50409, 
mCG50651, mCG5068, mCG5070, mOG51393, mCG52252, mCG52498, 

25 mCG53009, mCG53724, mCG55023, mCG55075, mCG55198, mCG55265, 
mCG55512, mCG56069, mCG56089. mCG56746. mCG57132, mCG57265, 
mCG57617, mCG57827, mCG58254, mCG58345, mCG5900, mCG5905, 
mCG59368, mCG59375, mCG59533. mCG59662, mCG59810, mCG59997, 
mCG60526, mCG60833, mCG61221, mCG61661, mCG61897, mCG61907, 

30 mCG61943, mCG62177, mCG62971, mCG63537, mCG63601, mCG64346, 
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mCG64382, mCG64398, mCG65022, inCG65585, mCG65785, mCG66128, 
mCG66379, mCG66776, inCG66965, mCG7831, mCG7856, mCG8424, 
mCG9002, inCG9537, mCG9538, mCG9791, mCG9792, mCG9843, inGG9875, 
mCG9877, and mCG9880. These genes are also listed in Table 1 below. 

5 

SUMMARY OF THE INVENTION 

A first object of the present invention is to provide novel genes involved 
in tumor development for use in the design of tumor-specific therapies. This 
object is, in the case of therapies for humans, achieved by using the human 
homologues of the murine genomic regions listed in Table 1 for the 
development of inhibitors directed against the genes encoded or affected by 
these regions that are involved in tumor development and/or their expression 
products and by using these inhibitors for the preparation of pharmaceutical 
compositions for the treatment of cancer, preferably for the treatment of 
leukemia. 

These murine genomic regions were discovered by using an improved 
method of proviral tagging involving a nested PGR approach as a result of 
which the cloning step as conventionally performed may now be omitted, 
resulting in a speeded up process of gene identification. Further, the improved 
method resulted in the identification of essentially only myeloid leukemia- 
related genes since the mice essentially only developed myeloid leukemia. Also 
the method supported the use of various murine letikemia viruses, which use 
surprisingly resulted in the identification of partly overlapping but otherwise 
essentially complementary sets of murine genomic regions involved in the 
development of murine myeloid leukemia. This then expands the possibilities 
of finding additional myeloid leukemia-related genes. 

Another improvement of the method of the present invention for the 
identification of genomic regions involved in the development of cancer, 
preferably of murine leukemia, resides in the use of an improved method for 
determining integration sites as being common. This method also involves a 
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powerful amplification procedure including nested PGR which is comhined 
with a nucleic acid blotting procedure, preferably Southern, as illustrated in 
Figure 2. 

The method of the invention for the identification of genomic regions 
5 involved in the development of cancer comprises a first step of performing 
retroviral insertional mutagenesis of a subject, comprising infecting said 
subject with a tumor inducing retrovirus. In a preferred embodiment, said 
subject is a mouse, preferably said mouse is an NIH-Swiss mouse or an FVB/N 
mouse and said retrovirus is Graffi murine 1.4 lexikemia virus (Gr-1.4) and/or 

10 Cas-BR-M murine leukemia virus (Cas-BR-M MuLV). The method of the 
invention comprises a subsequent step of isolating chromosomal DNA from 
primary tumors developed in the infected subject, such as present in e.g. 
spleen, liver, thymus, and lymph node tissue of mice with a vims-induced 
leukemia. The method of the invention comprises a subsequent step of 

15 digesting said chromosomal DNA with a restriction enzyme capable of cutting 
at least once in the genome of said tvimpr inducing retrovirus and at least once 
in the chromosomal DNA of said subject. Preferably, said chromosomal DNA is 
digested with a restriction enzyme recognizing restriction sites located at 
known positions within the virus LTR sequence but which wiU be imique 

20 within the genomic sequence in each virus insertion. Preferably, the restriction 
sites are separated such that a region of the chromosomal DNA of the subject 
flanking the known viral sequence can be identified based on the nucleotide 
sequence of said chromosomal DNA region, i.e. said region comprising a 
sufficient number of contiguous nucleotides to allow comparison thereof with a 

25 database of known sequences. 

The method of the invention comprises a subsequent step of ligating the 
digested DNA to circular DNA, followed by amplification of the chromosomal 
DNA fragment flanking the known viral sequence. In a method of the present 
invention, PGR amplification is carried out by performing a first PGR reaction 

30 with said circular DNA using a first set of retrovirus (LTR; long terminal 
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repeat)-specific primers followed by performing a second nested PGR with the 
product of said first PGR reaction using a second set of retrovirus (LTR)- 
specific primers. 

Following the step of amplifying the chromosomal DNA fragment 

5 flanking the known viral sequence, the nucleotide sequence of said 

chromosomal DNA fragment is determined, optionally via a cloning step, but 
preferably by performing a direct sequencing reaction, the latter being possible 
owing to the powerful and specific amplification procedures used. The 
determined nucleotide sequence of said chromosomal DNA firagment is then 

0 compared to a database of known sequences in order to identify the subject's 
genomic region wherein the retrovirus was integrated thereby indicating the 
genomic region potentially involved in the development of cancer. 

In order to determine whether an integration site is a common 
integration site, the method of the invention for the identification of genomic 

5 regions involved in the development of cancer comprises a step of designing 
genomic region-specific amplification primers, preferably in a nested format 
and said primers being capable of hybridizing to the chromosomal DNA 
firagment flanking the known viral sequence, the sequence of which firagment 
was determined as described hereinabove and which firagment represents a 

0 genomic region comprising a virus integration site. Then, RNA or DNA is 
isolated fi-om tumors to be analysed for the presence of a common integration 
site and an amplification reaction, preferably a nested (RT-)PCR reaction, is 
performed with one or more of the genomic region-^ecific primers (also 
termed locus specific primers herein) and one or more of the virus-specific 

5 primers, preferably retrovirus (LTR)-specific pirimers. Next, the amplification 
products are blotted and the blot is separately hybridized with a virus-specific 
probe and a genomic region-specific probe. Amplification products hybridizing 
with both probes are considered to represent common integration sites, such 
common integration sites indicating the genomic region involved in the 

3 development of cancer. 



wo 2004/016317 



PCT/NL2003/000S83 



10 

It is an aspect of the present invention to use tlie murine genomic 
regions of the present invention for the development of inhibitors directed 
against the genes encoded or affected by these genomic regions and/or their 
expression products. 
5 In one embodiment of the present invention, the inhibitors are 

antibodies and/or antibody derivatives directed against the expression 
products of genes encoded by the genomic regions or affected by genetic 
transformations in the genomic regions Usted in Table 1. Therapeutic 
antibodies are for instance useful against gene expression products located on 
10 the cellular membrane and can be comprised in a pharmaceutical composition. 
Also, antibodies may be targeted to intracellular, e.g. cjdioplasmic, gene 
products such as RNA's, polypeptides or enzymes, in order to modulate the 
activity of these products. Preferably, such antibodies are in the form of 
intrabodies, produced inside a cancer cell. In addition, antibodies may be used 
15 for deliverance of at least one toxic compoimd linked thereto to a tumor cell. 

In a preferred embodiment of the present invention, the inhibitor is a 
small molecule capable of modulating the activity or interfering with the 
function of the protein expression product of the genes encoded by the genomic 
regions or affected by genetic transformations in the genomic regions involved 
. 20 in tumor development as defined herein. In addition, small molectJes can also 
be used for deliverance of at least one hnked toxic compound to a tumor cell. 

On a different level of inhibition, nucleic acids can be used to block the 
production of proteins by destroying the mRNA transcribed ficom respective 
gene encoded by the genomic regions or affected by genetic transformations in 
25 the genomic regions. This can be achieved by antisense drugs, ribozymes or by 
RNA interference (RNAi). By acting at this early stage in the disease process, 
these drugs prevent the production of a disease-causing protein. The present 
invention relates to antisense drugs, such as antisense RNA and antisense 
oligodeoxynucleotides, ribozymes and RNAi molecules, directed against the 
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genes encoded by the genomic regions or affected by genetic transformations in 
the genomic regions listed in Table 1 or transcription products thereof 

The expression level of a gene can either be decreased or increased 
during tumor development. Naturally, inhibitors are used when the expression 
5 levels axe elevated. However, the present invention also provides for 

"enhancers", to boost the expression level of a gene encoded by the genomic 
regions or affected by genetic transformations in the genomic regions involved 
in tumor development and of which the expression levels are reduced. 
"Enhancers" may be any chemical or biological compoimd known or found to 
10 increase the expression level of genes, to improve the function of an expression 
product of a gene or to improve or restore the expression of a dysfunctional 
gene. 

Very suitable therapies to overcome reduced expression levels of a gene 
or to restore the expression of a dysfunctional gene encoded by the genomic 

15 regions or affected by genetic transformations in the genomic regions as 

disclosed herein iaclude the replacement by gene therapy of the dysfunctional 
or affected gene or its regulatory sequences that drive the expression of said 
gene. The invention therefore relates further to gene therapy, in which a 
dysfunctional gene of a subject encoded by the genomic regions or affected by 

20 genetic transformations in the genomic regions as listed in Table 1 or a human 
homologue thereof or a dysfunctional regulatory sequence of a gene of a subject 
. encoded by the genomic regions or affected by genetic transformations in the 
genomic regions as disclosed hsted in Table 1 or a human homologue thereof is 
replaced by a functional counterpart, e.g. by stable integration of for instance a 

25 lentiviral vector comprising a functional gene or regulatory sequence into the 
genome of a subject's host cell which is a progenitor cell of the tumor ceU-line 
of the subject and graftiag of said transfected host cell into said subject. 

The invention also relates to forms of gene therapy, in which the genes 
encoded by the genomic regions or affected by genetic transformations in the 

30 genomic regions listed ia Table 1 or a human homologue thereof are La. used 
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for the design of dominant-negative forms of these genes which inhibit the 
function of their wild -type coimterparts following their directed expression 
from a suitable vector in a cancer cell. 

Another object of the present invention is to provide a pharmaceutical 
5 composition for the treatment of cancer comprising one or more of the 
inhibitors, "enhancers", replacement compounds, vectors or host cells 
according to the present invention as a pharmaceutical reagent or active 
ingredient. The composition can further comprise at least one pharmaceutical 
acceptable additive like for example a carrier, an emulsifier, or a conservative. 

10 In addition, it is the object of the present invention to provide a method 

for treatment of subjects suffering from cancer which method comprises the 
administration of the pharmaceutical composition according to the invention to 
patients in need thereof in a therapeutically effective amount. 

A further aspect of the present invention relates to the use of the 

15 murine genomic regions or as disclosed herein as well as to human homologues 
thereof or their gene expression products for the development of reagents for 
the diagnosis of cancers. 

The invention also provides diagnostic compositions for diagnosing 
cancer, comprising the diagnostic reagents such as specific nucleic acid probes 

20 and/or specific antibodies capable of specifically binding to the murine genomic 
regions as disclosed herein as well as to human homologues thereof, to the 
transcription products of genes encoded thereby and/or to the expression 
products of such genes, respectively. 

Additionally, the present invention relates to methods for diagnosing 

25 cancer, preferably leukemia, by using a diagnostic composition of the present 
invention. 

In preferred aspects and embodiments of the present invention use is 
made of at least 2, preferably at least 4, more preferably at least 10, even more 
preferably at least 30 and most preferably all murine genomic regions selected 
30 from the group consisting of Adamll, Akap7, Arpgapl4 , Bomb, Cd24a, Cish2, 
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Cig5, CKc3, Cra, Dermol, EMILIN, Flj20489, GalntS, Hook, lerS, IL16, Iprgl, 
Itgp, Kcnk5, lrrc2, Ltb, MbU, Mrcl, Mtap7, Ninj2, Nrldl, Pcdh9, Prdx2, Prpsl, 
Pdil, Ptgd3, Rgll, Sardh, Scya4, Slcl6A6, SwapTO, Txnip, Triiii46, Tnfrsfl? 
and Ubl3 listed in Table 1. Using these genes very advantageous expression 
5 smalysis in acute myeloid leukemia (AML) patients and/or diagnostic methods 
of leukemia may be performed as each of these genes appeared selectively up 
or down regulated (on the mRNA level) in one or more specific subgroups of 
AML. 

Particularly preferred is the use in aspects and embodiments of the present 
10 invention of at least 2, preferably at least 3, more preferably at least 4, and 

most preferably all genes of the group of murine genomic regions consisting of 

Cd24a, Cish2, Cra, Ltb and Prdx2 Usted in Table 1. 

In addition to the newly foimd genomic regions and genes aflfected by 

transformations in such regions andof which the involvement in the 
15 development in tumor was hitherto unknown, use in aspects and embodiments 

of the present invention may also be made of gene sequences known to be 

involved in cancer development, such as for instance p53, Notch-1, Evi-1, NFl 

(Evi'2X Lck'l, Pim-1, HoxA9 (Evi-6), Fli-l, Yyl, Pps, Ptpnl and N-Myc, 



20 DEFINITIONS 

The term "murine genomic region" or "genomic region" as used herein 
indicates the site of integration of the nucleic acid of the virus used in proviral 
tagging of (mouse) genomic DNA. Such a viral integration site is involved in 
development of cancer when it is identified as a common integration site and 

25 may be located inside a known or putative gene, nearby, e.g. before or after a 
known or putative gene such as in a regxilatory sequence thereof, or in a yet 
unknown but defined position within the (murine) genome. As such, a "murine 
genomic region" herein indicates a gene, a putative gene, a region nearby a 
gene or putative gene, or a region of the murine genome the function of which 

30 is hitherto unknown. Genes comprised in such genomic regions or genes of 
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which the expression is aflGected by genetic transformations in such genomic 
regions are believed to be involved in cancer development as are the nucleotide 
sequences of the regions themselves and the expression products thereof. 
"Expression products" may be protein and/or RNA. 
5 A "murine genomic region-related gene" as used herein is a gene which 

is ftdly or partially comprised in a murine genomic region of the present 
invention or a gene affected by transformations therein, i.e. in general; a gene 
which function or expression is aflfected by a transformation in a mxirine 
genomic region of the invention. 

10 The term "target cell" as used herein indicates a eukaryotic cell, 

preferably an animal (including human), more preferably a mammalian ceU, 
iacluding a specialized tissue ceU and progenitor cell thereof, which is capable 
of being infected or transfected by vector of the invention, or which is the 
target for a diagnostic reagent of the present invention. 

15 The term "producer cell" as used herein refers to a cell or cell-line, 

respectively, suitable for rephcation, propagation, and/or production, of a 
vector of the invention. 

The term "host cell" as used herein generally indicates a cell used for the 
expression of a viral genome, or propagation of a vector or virus and, in the 

20 context of the present invention, includes both the target and producer cell. 

The term "viral vector" refers to a nucleic acid construct comprising a 
viral genome capable of being transcribed in a host cell, which genome 
comprises sufficient viral genetic information to allow packaging of the viral 
RNA genome, in the presence of packaging components, into a viral particle 

25 capable of infecting a target cell. Infection of the target cell includes reverse 
transcription and integration into the target cell genome, where appropriate 
for particular viruses. 

The term "isolated nucleic acid sequence" means a nucleic acid sequence 
that is free of the nucleotide sequences that flank the nucleic acid sequence in 

30 the naturally-occm-ring genome of the organism from which it is derived. The 
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term therefore includes, for example, any recombinant DNA which is 
incorporated into a vector; into an autonomously replicating plasmid or into 
the genomic DNA of a prokaryote or eukaryote from which it has not been 
derived; or which exists as a separate molecule (e.g., an RNA, a cDNA or a 
5 genomic or cDNA fragment produced by PGR or restriction endonuclease 

digestion) independent of other sequences. It also includes a recombinant DNA 
which is part of a hybrid gene encoding additional poljrpeptide sequences. 

As used herein, "polynucleotide", "nucleic acid" or "oligonucleotide" 
includes reference to a deoxyribonucleotide or ribonucleotide poljoner in either 

10 single-or double-stranded form, and unless otherwise limited, encompasses 
known analogues of natural nucleotides that hybridize to nucleic acids in a 
manner similar to naturally occurring nucleotides. In specific embodiments, 
the "polynucleotide", "nucleic acid" or "oligonucleotide" can be substituted by 
chemical substances that can form sequence-specific interactions similar as for 

15 the natural phosphodiester "nucleic acid". Kiiown and preferred analogues 

include polymers of nucleotides with phosphorothioate or methylphosphonate 
liaisons, or peptide nucleic acids. Unless otherwise indicated, a particular 
nucleic acid sequence includes the complementary sequence thereof. 

A coding sequence is "imder the control" of regulatory sequences in a cell 

20 when RNA polymerase transcribes the coding sequence into mRNA encoded by 
the coding sequence. 

The term "coding sequence" as defined herein is a sequence which is 
transcribed into mRNA and optionally translated into a polypeptide when 
placed xmder the control regulatory sequences. The boundaries of the coding 

25 sequence are generally determined by a translation start codon ATG at the 5'- 
terminus and a translation stop codon at the 3*-terminus. A coding sequence 
can include, but is not limited to, RNA, DNA, cDNA, and recombinant nucleic 
acid sequences. 

The term "regxdatory sequences" is defined herein to include aU 
30 components which are necessary or advantageous for expression of a coding 
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sequence. Each regulatory sequence may be native or foreign to the coding 
sequence. Such regulatory sequences include, but are not limited to, a leader, a 
polyadenylation sequence, a propeptide sequence, a promoter, a signal 
sequence, and a transcription terminator. At a minimum, the regulatory 
5 sequences include a promoter, and transcriptional and translational stop 
signals. The regulatory sequences may be provided with linkers for the 
purpose of introducing specific restriction sites facOitating hgation of the 
regulatory sequences with the coding region of the nucleic acid sequence 
encoding a polypeptide. 

10 The term "promoter" is used herein for its art-recognized meaning to 

denote a portion of a gene containing DNA sequences that provide for the 
binding of RNA polymerase and initiation of transcription. Promoter sequences 
are commonly, but not always, found in the 5' non-coding regions of genes. 
The term "packaging components" refers to building blocks which are 

15 necessary for encapsulation of nucleic acid of a virus into mature viral 
particles. 

The terms "nucleotide sequence homology" or "sequence homology" or 
"homologous sequence" as used herein denote the presence of homology 
between two or more polynucleotides. Polynucleotides have "homologous" 

20 sequences if the sequence of nucleotides in their sequences is the same when 
aligned for maximum correspondence. Sequence comparison between two or 
more polynucleotides is generally performed by comparing portions of two 
sequences over a comparison window to identify and compare local regions of 
sequence similarity. The comparison window is generally &om about 20 to 200 

25 contiguous nucleotides. The "percentage of sequence homology" for 

poljmucleotides, such as 50, 60, 70, 80, 90, 95, 98, 99 or 100 percent sequence 
homology may be determined by comparing two optimally aligned sequences 
over a comparison window, wherein the portion of the polynucleotide sequence 
in the comparison window may include additions or deletions (i.e. gaps) as 

30 compared to the reference sequence (which does not comprise additions or 
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deletions) for optimal aligniiient of the two sequences. The percentage is 
calcvilated by: (a) determining the nxmiber of positions at which the identical 
nucleic acid base occurs in both sequences to yield the number of matched 
positions; (b) dividing the number of matched positions by the total number of 
5 positions in the window of comparison; and (c) multiplying the result by 100 to 
yield the percentage of sequence homology. Optimal ahgnment of sequences for 
comparison may be conducted by computerized implementations of known 
algorithms, or by visual inspection. Readily available sequence comparison and 
multiple sequence ahgnment algorithms are, respectively, the Basic Local 

10 Alignment Search Tool (BLAST) (Altschul, S.F. et al. 1990, J. MoL Biol. 
215:403; Altschtd, S.F. et al. 1997. Nucleic Acid Res. 25:3389-3402) and 
ClustalW programs both available on the internet. Other suitable programs 
include GAP, BESTFIT and FASTA in the Wisconsin Genetics Software 
Package (Genetics Computer Group (GOG), Madison, WI, USA). 

15 As used herein, the term "substantially homologous" for nucleic acid 

sequences should be interpreted as two nucleic acid sequences having a 
"percentage of sequence homology" of at least 40, preferably at least 60, more 
preferably at least 80, even more preferably at least 90, still more preferably at 
least 95, still more preferably at least 98, and most preferably at least 99 

20 percent nucleotide sequence homology. 

As used herein, the term "human homologue" should be interpreted as a 
human (putative) gene having the same function as the (putative) gene 
identified in mouse. A "human homologue" refers with respect to its respective 
miirine gene as described in the present invention to a hum an gene which 

25 encodes an amino acid sequence which shows more than 45 %, preferably more 
than 60 %, more preferably more than 70 %, stUl more preferably more than 80 
%, even more preferably more than 90 % and most preferably more than 95 % 
identity with the amino acid sequence derived jfrom the murine genes of the 
invention and is in the case of an enzyme the amino acid sequence of a 

30 polypeptide which shows the same type of enzymatic activity as the enzjrme 
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encoded murine genes of the invention. A "human homologue" of a murine 
genomic region, not being a (putative) gene, as defined herein indicates a 
genomic region within the hximan genome that, when genetically transformed, 
results in an aberrant function of a gene which is identical in human and 
5 mouse. 

As used herein, the terms "antibody" and "antibodies" refer to 
monoclonal antibodies, multispecific antibodies, synthetic antibodies, human 
antibodies, humanized antibodies, chimeric antibodies, single-chain Fvs (scFv), 
single chain antibodies, Fab fragments, F(ab') fragments, disulfide -linked Fvs 

10 (sdEV), and anti-idiotypic (anti-Id) antibodies (including, e.g., anti-Id 

antibodies to antibodies of the invention), and epitope-binding fragments of 
any of the above. In particular, antibodies of the present invention include 
immunoglobulin molecules and immunologically active portions of 
immunoglobulin molecules, i.e., molecules that contain an antigen binding site 

15 that immunospecificaUy binds to a polypeptide antigen encoded by a gene 

comprised in the genomic regions or affected by genetic transformations in the 
genomic regions listed in Table 1. The immunoglobulin molecules of the 
invention can be of any type {e.g., IgG, IgE, IgM, IgD, IgA and IgY), class (e.^., 
IgGi, IgGa, IgGs, IgG4, IgAi and IgA2) or subclass of immunoglobulin molecule. 

20 The term "oligonucleotide array" refers to a substrate having a two- 

dimensional surface having at least two different features. Oligonucleotide 
arrays preferably are ordered so that the localization of each feature on the 
surface is spotted. In preferred embodiments, an array can have a density of at 
least five hundred, at least one thousand, at least 10 thousand, at least 100 

25 thousand features per square cm. The substrate can be, merely by way of 
example, glass, silicon, quartz, polymer, plastic or metal and can have the 
thickness of a glass microscope slide or a glass cover shp. Substrates that are 
transparent to hght are useful when the method of performing an assay on the 
chip involves optical detection. As used herein, the term also refers to a probe 

30 array and the substrate to which it is attached that form part of a wafer. The 



wo 2004/016317 



PCT/NL2003/000583 



19 

substrate can also be a membrane made of polyester or nylon. In this 
embodiment, the density of features per square cm is comprised between a few 
units to a few dozens. 

"Modulating" according to the present invention should be understood 
5 as regulating, controlling, blocking, inhibiting, stimulating, enhancing, 

activating, mimicking, bypassing, correcting, removing, and/or substituting 
said gene expression or said route, in more general terms, intervening in said 
gene expression or said route. 

"Subject" as used herein includes, but is not limited to, an organism; a 
10 mammal, including, e.g., a human, non-human primate, mouse, pig, cow, goat, 
cat, rabbit, rat, guinea pig, hamster, horse, monkey, sheep, or other non- 
human mammal; and a non-mammal, including, e.g., a non-mammalian 
vertebrate, such as a bird (e.g., a chicken or duck) or a fish, and a non- 
mammalian invertebrate. 

15 

LEGEND TO THE FIGURES 

Figure 1 is a graphic representation of a directed PGR on chromosomal 
DNA to determine commonality of inverse PGR identified virus integration site 
as described in Example 3. Herein, primers XI, XIN, X2 and X2N are designed 

20 in a locus or genomic region that was identified as a virus integration site by 
inverse PGR. To amphfy flanking genomic sequences and to determine the 
locaUzation and orientation of the integrated provirus, 4 different nested PCRs 
were performed. First, the primers XI and X2 were combined with the Graffi- 
1.4 MuLV LTR primers LI and L2. These products were amplified by a nested 

25 PGR, using the primers XIN and X2N in combination with LIN and L2N. The 
specificity of the amplified bands was checked using probes PI and P2 in 
Southern blot analysis. 

Figure 2 illustrates the method of the present invention for the 
identification of disease loci. IPCR or RT-PCR was performed on RNA or DNA 

30 from cell lines (DA and NFS), or GSL leukemias. The resulting virus flanking 
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fragments were subjected to sequence analysis. LTR- sind locus-specific 
primers were designed and used in a nested PGR strategy (*), i.e. LTR- 
1/primer-A PGR, followed by a LTR-2/primer-B amplification using genomic 
DNA from a panel of CasBr-M MuLV-induced leukemias. PGR products were 
5 electrophoresed on a 1.5% agarose gel and subsequently blotted. Blots were 
first hybridized with locus-specific probe C and exposed to film, and after 
stripping rehybridized with probe LTR3. Bands hybridizing with the LTR3 as 
well as the locus specific probe C (lanes 1, 2, 3, 6 and 8) were considered 
positive, i.e. tumors carrying this particular common viral integration site 
10 (cVIS). Hybridization with one primer only (lanes 4 and 7) are false positives. 
In each experiment two or three positive fragments were cloned and nucleotide 
sequenced to confirm specificity 

DETAILED DESCRIPTION OF THE INVENTION 

15 As outlined above, knowledge of the identity of genes involved in cancer 

development greatly facilitates the development of prophylactic, therapeutic 
and diagnostic methods for this disease. The discovery of a great number of 
additional cancer genes and potential cancer genes now provides for the tools 
to detect and treat cancer and to provide for prophylactic, therapeutic and 

20 diagnostic compositions useful for treating subjects suffering from cancer or 
diagnosing subjects suspected of having the disease or even diagnosing the 
severity or type of the disease. 

In a preferred embodiment, the present invention is aimed at providing 
methods, substances, compositions, and uses according to the invention for the 

25 treatment of leukemia, even more preferably for the treatment of acute 
myeloid leukemia (AML). 

Leukemia is a type of cancer that originates in the bone marrow where, 
the blood forming cells (hematopoietic stem cells and progenitor cells) in adults 
reside. Leukemia is an uncontrolled production and accumulation of cancerous 

30 blood cells that have usually partly or completely lost their ability to develop 
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into differentiated and functional blood cell types. Lenkemias can be divided in 
clinically distinct categories with largely different prognosis and reqiiirements 
for the type of treatment. Based on clinical and cytological parameters, 
leukemias are grossly classified in the following categories: chronic myeloid 
5 leukemia (CML), chronic lymphocytic leukemia (CLL), acute myeloid leukemia 
(AML) and acute lymphoblastic leukemia (ALL). The clinico-pathological 
characteristics of these types of leukemia are distinct and require different 
forms of treatment with variable aggressiveness and treatment-related 
morbidity and mortality. 

10 In the past two decades, a further refinement of the above-mentioned 

classification has been developed based on immunological, cytogenetic and, 
more recently, molecular genetic parameters. This refijaement has resulted in 
improvements in treatment selection and a better prediction of the prognosis 
and treatment responses of patients and has formed the basis for the 

15 categorization of leukemia patients into good risk, standard risk and poor risk 
categories. However, these categories are still heterogeneous and further 
refinement is urgently required. 

In addition, further insights in the pathogenetic and pathophysiological 
mechanisms underlying this heterogeneity is urgently needed as it would 

20 provide an invaluable source of information for the development of new 

therapies aimed at specific pathogenetic mechanisms. This will not only resiilt 
in reduced mortality and morbidity with equally efficient therapeutic impact, 
but also in the development of curative therapies for patients that are 
refiractory to the currently available treatment options for leukemia. 

25 Many of the genes affected by retroviral integration, as described in 

more detail in the examples below, appeared to be related to signaling, some of 
which have already been linked to hmnan levdcemia. For instance, in case of 
the Graffi-1.4 MxiLV, the Tie-1 gene, which encodes a tyrosine kinase receptor 
that is normally expressed in vascular endothelial cells and hematopoietic 

30 stem cells, is overexpressed in chronic myeloid leukemia (CML). Importantly, 
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high Tie-1 levels inversely correlate with siirvival of CML patients in early 
chronic phase. Strongly increased levels of Tie-1 were also detected in bone 
marrow samples from MDS (Myelodysplastic Syndromes) and AML patients. 
Notch-1 is another example of a CIS associated with human disease. The 
human homologue of the Notch (Tan-1) gene is involved in chromosomal 
translocation t(7;9)(q34;34.3) which results in a tnmcated receptor in human 
T-lymphoblastic neoplasms. Notch-1 is a pro viral integration site in mouse 
lymphoid leukemias . Our data suggest that aberrant Notch signaling may 
also be involved in myeloid leukemia. Notably, the integrations in the Notch 
gene predictably result in the constitutive formation of the truncated active 
form of Notch, which interferes with G-CSF-induced myeloid differentiation in 
the 32D cell model. 

Thus, the newly detected involvement of the gene sequences or murine 
genomic regions as disclosed herein, allows for the identification of novel 
routes of disease development and for a better xmderstanding in the 
cooperative action between genes in pathogenesis of cancer, in particular of 
leukemia and thus for a better diagnosis and treatment of the disease. 

The implications of the findings reported herein are not limited to 
murine cancers. Many of the murine genomic regions identified herein also 
play an important role in cancer development in humans and other mammals. 
Examples of genes that are involved in mouse as well as human leukemia are 
for instance the tumor suppressor gene Evi2/Nfl (Buchberg et aL^ Mol Cell 
Biol, 10, 4658-4666, 1990; Shannon aZ., NEnglJMed, 330, 597-601, 1994; 
Side et al.. Blood, 92, 267-722, 1998), or the oncogenes Nmyc-1 (Hirvonen et aL 
Leuk Lymphoma, 11, 197-205, 1993; Setoguchi et aL, Mol Cell Biol, 9, 4515- 
4522, 1989), Evil (Morishita et al. Oncogene Res, 5, 221-231, 1990; Morishita 
Cell, 54, 831-840, 1988), Evi6/Hoxa9 (Nakamura et aL, Nat Genet, 12, 154-158, 
1996; Nakamura et aL, Nat Genet, 12, 149-153, 1996), Bcll/CyclinDl (de Boer 
et aL, 1997 Ann Oncol, 8, 109-117; Silver & Buckler, J Virol, 60, 1156- 
1158,1986), Erg (Shimizu et aL, Proc Natl Acad Sci USA, 90, 10280-10284 
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1993; Valk et al, Nucleic Acids Res, 25, 4419-4421, 1997) or spi-l/Pu.l 
(Mueller BU et al., Blood 101(5):2074, 2003). An overview of currently known 
mxirine cancer genes identified in retroviral screens can be found on the 
internet address littp://genome2.ncifcrf.gov/RTCGD/ . Likewise, it is therefore 
5 predicted that a large number of the genes as disclosed herein is involved in 
cancer development in humans as well. 

Mutations are frequently observed in genes encoding growth factors, 
growth factor receptors and other membrane proteins, kinases, phosphatases 
and other regulatory enzymes, transcription regulators and proteins critical in 

10 the process of sxirvival and apoptosis. Multiple "leukemia genes" have been 
demonstrated to be disease genes responsible for the development of certain 
other types of malignancies as well, e.g. Ras (Bos JL, et al Blood. 69:1237- 
41,1987; , Gahana C. et al., Mol Carcinog. 14:286-93, 1995; van 't Veer et al., 
Oncogene 2:157-65, 1988), p55 (Carson DA, Lois A., Lancet. 14;346(8981):1009- 

15 11, 1995). NF'l (McLaughhn ME, Jacks T, Methods Mol Biol. 222: 223-237, 
2003; Nishi T, Saya H., Cancer Metastasis Rev. 10: 301-310, 1991) or C-kit 
(Rubin BP et al, Cancer Res. 61 :8118-8121, 2001). 

Acute myeloid leukemia (AML) is the most frequent form of acute 
leukemia in adults and is one of the most aggressive forms of leukemia, which 

20 is acutely life threatening unless treated with different kinds of chemotherapy. 
Depending on the AML subtype determined by various clinical parameters, 
including age, and laboratory findings, for instance cytogenetic features, 
allogeneic stem cell transplantation might follow the remission induction by 
chemotherapy. The 5 years overall and disease firee survival rate of adult AML 

25 is currently in the order of 35-40%. There is a strong need for a more precise 
diagnosis of AML, which allows for better distinction between the prognostic 
subtypes and for new therapeutic strategies for the large contiagent of patients 
that can not be cured to date. The cxu-rently available laboratory techniques 
allow for a prognostic classification, but this is still far from optimal. Still, 
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most patients cannot satisfactorily be risk-stratified and still a majority of 
patients are not cured by currently available treatment modalities. 

The pathogenesis of leukemia is complex. Before becoming clinically 
overt, leukemic cells have acquired multiple defects in regulatory genes that 
5 control normal blood cell production. In human leukemia, xmtil now only few of 
these genes have been identified, mainly by virtue of the fact that these genes 
were located in critical chromosomal regions involved in specific chromosome 
translocations found in human AML. Studies in mice, particularly those 
involving retroviral tagging, have yielded only relatively small numbers of 

10 retroviral insertions and target genes per study, but have nonetheless made 
clear that there are at least a few hundred genes that can be involved in the 
pathogenesis of murine leukemia. Based on the strong conservation between 
the mouse and human hematopoietic systems, as is for instance evident firom 
the fact that the biological properties of the hematopoietic progenitor cells and 

15 the regxilators (hematopoietic growth factors) are largely similar, it is very 
likely that a substantial proportion of the genes (or their close fomily 
members) implicated in mouse leukemia, can also be involved in the 
development of human disease. 

In order to substantiate the claim that the presently disclosed genes 

20 play an important role ia human lexikemia we investigated 288 cases of human 
AML, These consisted of AML cases with internal tandem duplications in the 
gene encoding the tyrosine kinase receptor FLT3 (FLT3-ITD), which feature is 
associated with a poor prognosis, as well as AML cases with a t(8;21) 
translocation or patients with a t(15;17) translocation, which features fall 

25 within the favourable risk categories. For 126 of the 237 genes as disclosed 
herein, a study for their potential involvement said 288 cases of human AML 
was performed. The study included the determinations of 1) whether the genes 
coxdd be detected, 2) whether the expression of the genes deviated significantly 
from expression levels in normal human bone marrow ceUs, in particular the 

30 immature (CD34+) subset, and 8) to what extent these genes might be 
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differentially expressed in the most significant prognostic subgroups of AML 
that are defined to date. The resiilts of these determinations indicated that the 
genes as disclosed herein provide for additional diagnostic power in detecting 
as well as in differentiating AML. 
5 Although a large number of the murine genomic regions as disclosed in 

the present invention, i.e. in Table 1, encode known genes, i.e. encode for 
known murine genes, the relevance of these genes in the development of 
myeloid leukemia was not previously recognized. 

Fxirthermore, the remainder of the murine genomic regions involved in 

10 tumor development as disclosed in the present invention comprise unknown 
nucleic acid sequences, i.e. do not show any significant homology to sequences 
that encode known murine genes as available in present-day databases and 
comprise i.a. putative gene sequences. Nonetheless, the involvement of these 
murine genomic regions in the development of myeloid leukemia was 

15 demonstrated for all nucleic acid sequences as hsted in Table 1. 

The finding that these murine genomic regions are involved in the 
development of myeloid leukemia does not preclude their role in other forms of 
leukemia or even other cancers. Therefore, the murine genomic regions as 
Hsted in Table 1 may also play an important role in cancers such as bone 

20 cancers, brain tumors, breast cancer, endocrine system cancers, 

gastrointestinal cancers, gynaecologic cancers, head and neck cancers, lung 
cancers, l3anphomas, metastases, myelomas, pediatric cancers, pemle cancer, 
prostate cancer, sarcomas, skin cancers, testicular cancer, thyroid cancer and 
urinary tract cancers. 

25 The present invention provides in one aspect for nucleic acid sequences 

suitable for therapeutic and/or diagnostic use. 

Murine genomic regions disclosed in the present invention are useful for 
developing therapeutic and diagnostic methods. For example, a diagnostic 
assay to determine the stage of the disease may be developed and may also be 

30 useful in tailoring the treatment of aggressive versus milder cancer or tumor 
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progression. For instance it is known that the t(15;17) translocation results in 
a mutation in the gene encoding a all- trans retinoic acid (ATEA) receptor in 
differentiating blood ceUs and results in an impaired ATRA receptor activity, 
required for proper differentiation. As a result, differentiation of the ceUs is 

5 impaired leading to leukemia. A well accepted treatment is currently the 
administration of high doses of ATRA in order to allow the cells to 
differentiate. This demonstrates that knowledge of the molecular mechanism 
of cancer may provide important implications for therapy. 

The murine genomic regions, genes comprised therein and pol5^eptides 

0 encoded thereby as well as genes and their encoded pol3T)eptides affected by 
transformations in murine genomic regions of the invention are useful for 
designing diagnostic reagents useful in such methods. 

Further, modulation of genes or gene expression products that are mis- 
regtdated can be used to treat or ameliorate cancer, tumor progression, 

5 hyperproUferative cell growth, and the accompanying physical and biological 
manifestations. For example, the murine genomic regions provided herein, can 
be used to construct nucleic acid and polypeptide compositions useful for 
treatment, such as antisense, ribozsrmes, antibodies, vaccine antigens, and 
immune system inducers, for example. Thus, the present invention relates to 

0 methods and reagents for diagnosis, and to methods and compositions for 
treatment. 

In one aspect, use is provided of nucleic adds having a sequence of one 
or more of the cancer-related genes as disclosed herein to obtain full-length 
cDNA and full-length human gene and promoter regions. One such use is the 
;5 generation of the polypeptides encoded by the gene sequence. 

Polypeptide production. 

The polynucleotide, the corresponding cDNA, or the full-length gene 
encoded in a mxirine genomic region of the invention may be identified by 
10 using computer algorithms for the identification of open reading frames 



wo 2004/016317 



PCT/NL2003/000583 



(ORFs) therein. Subsequently, full length cDNAs comprising the complete or 
partial coding sequences may be prepared synthetically and ligated into a 
suitable expression system to yield a polynucleotide construct of the invention. 
Appropriate poljniucleotide constructs are pxirified using standard 
5 recombinant DNA techniques as described in, for example, Sambrook et aL, 
(1989) Molecvdar Cloning: A Laboratory Meinual, 2nd ed. (Cold Spring Harbor 
Press, Cold Spring Harbor, N.Y.). The polypeptides encoded by the 
polynucleotides are expressed in any expression system, including, for 
example, bacterial, yeast, insect, amphibian and mammahan systems. Smtable 

10 vectors and host cells are described in U.S. Pat. No. 5,654,173. 

Expression systems in bacteria include those described in Chang et al., 
Nature (1978) 275:615, Goeddel et aL, Nature (1979) 281:544, Goeddel et al., 
Nucleic Acids Res. (1980) 8:4057; EP 0 036,776, U.S. Pat. No. 4,551,433, 
DeBoer et al., Proc. Natl. Acad. Sci. (USA) (1983) 80:21-25, and SiebenHst et 

15 al., CeU (1980) 20:269. 

Expression systems in yeast include those described in Hinnen et al., 
Proc. Natl. Acad. Sci. (USA) (1978) 75:1929; Ito et al., J. BacterioL (1983) 
153:163; Kurtz et al., Mol. Cell. Biol. (1986) 6:142; K\uize et al., J. Basic 
Microbiol. (1985) 25:141; Gleeson et al., J. Gen. Microbiol. (1986) 132:3459, 

20 Roggenkamp et al., Mol. Gen. Genet. (1986) 202:302) Das et al., J. BacterioL 

(1984) 158:1165; De Louvencourt et al., J. Bacterid. (1983) 154:737, Van den 
Berg et al., Bio/Technology (1990) 8:135; Kunze et al., J. Basic Microbiol. 

(1985) 25:141; Gregg et al., Mol. CeU. Biol. (1985) 5:3376, U.S. Pat. Nos. 
4,837,148 and 4,929,555; Beach and Nurse, Nature (1981) 300:706; Davidow et 

25 al., Curr. Genet. (1985) 10:380, Gaillardin et al., Curr. Genet. (1985) 10:49, 

BaUance et al., Biochem. Biophys. Res. Commim. (1983) 112:284-289; Tilbum 
et al., Gene (1983) 26:205-221, Yelton et al., Proc. Natl. Acad Sci. (USA) (1984) 
81:1470-1474, Kelly and Hynes, EMBO J. (1985) 4:475479; EP 0 244,234, and 
WO 91/00357. 



wo 2004/016317 



PCT/NL2003/000583 



28 

Expression of heterologous genes in insects is accomplished as described 
in U.S. Pat. No. 4,745,051, Priesen et al. (1986) "The Regulation of Baculovirus 
Gene Expression" in: The Molecular Biology Of Baculoviruses (W. Doerfler, 
ed.), EP 0 127,839, EP 0 155,476, and Vlak et al., J. Gen. Virol. (1988) 69:765- 
5 776, Miller et al., Ann. Rev. Microbiol. (1988) 42:177, Carbonell et al., Gene 
(1988) 73:409, Maeda et al.. Nature (1985) 315:592-594, Lebacq-Verheyden et 
al., MoL Cell. Biol. (1988) 8:3129; Smith et al., Proc. Natl. Acad. Sci. (USA) 
(1985) 82:8404, Miyajima et al.. Gene (1987) 58:273; and Martin et al., DNA 
(1988) 7:99. Numerous baculoviral strains and variants and corresponding 

10 permissive insect host cells from hosts are described in Luckow et al., 

Bio/Technology (1988) 6:47-55, Miller et al., Generic Engineering (Setlow, J. K, 
et al. eds.), Vol. 8 (Pleniun Pubhshing, 1986), pp. 277-279, and Maeda et al., 
Nature, (1985) 315:592-594. 

Mamjnahan expression is accomplished as described in Dijkema et al., 

15 EMBO J. (1985) 4:761, Gorman et al., Proc. Natl. Acad. Sci. (USA) (1982) 

79:6777, Boshart et al., CeU (1985) 41:521 and U.S. Pat. No. 4,399,216. Other 
features of mammalian expression are facilitated as described in Ham and 
Wallace, Meth. Enz. (1979) 58:44, Barnes and Sato, Anal. Biochem. (1980) 
102:255, U.S. Pat. Nos. 4,767,704, 4,657,866, 4,927,762, 4,560,655, WO 

20 90/103430, WO 87/00195, and U.S. Pat. No. RE 30,985. 

Polynucleotide molecules or parts thereof comprising the murine 
genomic regions listed in Table 1 or comprised therein are propagated by 
placing the molecule in a vector. Viral and non-viral vectors are used, 
including plasmids. The choice of plasmid will depend on the type of cell ia 

25 which propagation is desired and the purpose of propagation. Certain vectors 
are useful for amplifying and making large amounts of the desired DNA 
sequence. Other vectors are suitable for expression in cells in culture. Still 
other vectors are suitable for transfer and expression in cells in a whole animal 
or person. The choice of appropriate vector is well within the skill of the art. 

30 Many such vectors are available commercially. The polsmucleotide is inserted 
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into a vector typically by megms of DNA ligase attachment to a cleaved 
restriction enzyme site in the vector. Alternatively, the desired nucleotide ' 
sequence may be inserted by homologous recombination in vivo. Typically this 
is accomplished by attaching regions of homology to the vector on the flanks of 
5 the desired nucleotide sequence. Regions of homology are added by ligation of 
ohgonucleotides, or by polymerase chain reaction using primers comprising 
both the region of homology and a portion of the desired nucleotide sequence, 
for example. 

Polynucleotides are hnked to regulatory sequences as appropriate to 
10 obtain the desired expression properties. These may include promoters 

(attached either at the 5* end of the sense strand or at the 3' end of the 

antisense strand), enhancers, terminators, operators, repressors, and inducers. 

The promoters may be regulated or constitutive. In some situations it may be 

desirable to use conditionally active promoters, such as tissue-specific or 
15 developmental stage-specific promoters. These are linked to the desired 

nucleotide sequence \ising the techniques described above for linkage to 

vectors. Any techniques known in the £irt may be used. 

When any of the above host cells, or other appropriate host cells or 

organisms, are used to replicate and/or express the polynucleotides or nucleic 
20 acids of the invention, the resulting replicated nucleic acid, RNA, expressed 

protein or polypeptide, is within the scope of the invention as a product of the 

host cell or organism. The product is recovered by any appropriate means 

known in the art. 

Once the gene corresponding to a polypeptide is identified, its 
25 expression can be regulated in the cell to which the gene is native. For 

example, an endogenous gene of a cell can be regulated by an exogenous 

regulatory sequence as disclosed in U.S. Pat. No. 5,641,670, "Protein 

Production and Protein Delivery." 

30 Identification of Secreted and Mem brane-Bound Polypeptides 
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Both secreted and membrane-bound polypeptides encoded in murine 
genomic regions or affected thereby in any way sire of interest. For example, 
levels of secreted polypeptides can be assayed conveniently in body fluids, such 
as blood or urine. Membrane-bound polj^ep tides are useful for constructing 
5 vaccine antigens, for inducing an immime response, or for diagnosis of whole 
cells. Such antigens would comprise all or part of the extracellular region of 
the membrane-bound polypeptides. 

Because both secreted and membrane-bound polypeptides comprise a 
fragment of contiguous hydrophobic amino acids, hydrophobicity predicting 
10 algorithms can be used to identify such polypeptides. 

A signal sequence is usually encoded by both secreted and membrane- 
bound polypeptide genes to direct a polypeptide to the surface of the cell. The 
signal sequence usually comprises a stretch of hydrophobic residues. Such 
signal sequences can fold into helical structures. 
15 Membrane-bound polypeptides typically comprise at least one 

transmembrane region that possesses a stretch of hydrophobic amino acids 
that can transverse the membrane. Some transmembrane regions also exhibit 
a helical structure. 

Hydrophobic fragments within a pol3rpeptide can be identified by using 
20 computer algorithms. Such algorithms include Hopp & Woods, Proc. Natl. 
Acad. Sci. USA 78: 3824-3828 (1981); Kyte & DooUttle, J. MoL Biol. 157:105- 
132 (1982); and RAOAR algorithm, Degli Esposti et al., Eur. J. Biochem. 190: 
207-219 (1990). 

Another method of identifying secreted and membrane-bound 
25 poljTpeptides is to translate the present poljmucleotides, listed in Table 1, and 
determine if at least 8 contiguous hydrophobic amino acids are present. Those 
translated polypeptides with at least 8; more typically, 10; even more typically, 
12 contiguous hydrophobic amino acids are considered to be either a putative 
secreted or membrane bound polypeptide. Hydrophobic amino acids include 
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alanine, glycine, histidine, isoleucine, leucine, lysine, methionine, 
phenylalanine, proline, threonine, tryptophan, tyrosine, and vaUne. 

Construction of Polypeptides of the Invention and Variants Thereof 

The polypeptides useful for the development of therapeutic reagents 
iaclude those encoded by the nucleotide sequences of genes or putative genes 
as disclosed herein. These nucleotide sequences can also be encoded by nucleic 
acids that, by virtue of the degeneracy of the genetic code, are not identicsll in 
sequence to the disclosed nucleotide sequences. Thxis, the invention includes 
within its scope isolated nucleic acids comprising a nucleotide sequence 
encoding a protein or polypeptide expressed by said isolated nucleic acid 
having the sequence of any one of the gene sequences listed in Table 1 or 
having a sequence substantially homologous thereto. Also within the scope of 
the invention are variants; variants of polypeptides include mutants, 
fragments, and fusions. Mutants can include amino acid substitutions, 
additions or deletions. The amino acid substitutions can be conservative amino 
acid substitutions or substitutions to eliminate non-essential amino acids, such 
as to alter a glycosylation site, a phosphorylation site or an acetylation site, or 
to minimize misfolding by substitution or deletion of one or more cysteine 
residues that are not necessary for function. Conservative amino acid 
substitutions are those that preserve the general charge, 

hydrophobicity/hydrophiHcity, and/or steric bulk of the amino acid substituted. 
For example, substitutions between the following groups are conservative: 
Gly/Ala, Val/Ile/Leu, Asp/Glu, Lys/Arg, Asn/Gln, Ser/Cys,Thr, and 
Phe/Trp/Tyr. The genetic code can be used to select the appropriate codons to 
construct the corresponding ysiriants. 

Small molecule inhibitors 

Small molecule inhibitors are usually chemical entities that can be 
obtained by screening of already existing Hbraries of compoimds or by 
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designing compounds based on the structure of the protein encoded by a gene 
involved in tumor development. Briefly, the structure of at least a fragment of 
the protein is determined by either Nuclear Magnetic Resonance or X-ray 
crystallography. Based on this structure, a virtual screening of compounds is 

5 performed. The selected compounds are synthesized using medicinal and/or 
combinatorial chemistry and thereafter analyzed for their inhibitory effect on 
the protein in vitro and in vivo. This step can be repeated until a compound is 
selected with the desired inhibitory effect. After optimization of the compound, 
its toxicity profile and efficacy as cancer therapeutic is tested in vivo using 

0 appropriate animal model systems. 

Differentially expressed genes that do not encode membrane-bound 
proteins are selected as targets for the development of small molecule 
inhibitors. To identify putative binding sites or pockets for small molecules on 
the surface of the target proteins, the three-dimensional structxire of those 

5 targets are determined by standard crystallization techniques (de Vos et ah 
1988, Science 239:888-93; Williams et al. 2001, Nat Struct Biol 8:838-42), 
Additional mutational analysis may be performed to confirm the functional 
importance of the identified binding sites. Subsequently, Cerius2 (Molecular 
Simulations Inc., San Diego, CA, USA) and Ludi/ACD (Accehrys Inc., San 

0 Diego, CA, USA) software is used for virtual screening of small molecule 
libraries (Bohm. 1992 J Comp Aided Molec Design 6:61-78). The compounds 
identified as potential binders by these programs are S3mthesized by 
combinatorial chemistry and screened for binding afEinity to the targets as well 
as for their inhibitory capacities of the target protein's function by standard in 

5 vitro and in vivo assays. In addition to the rational development of novel small 
molecules, existing small molecule compound libraries are screened using 
these assays to generate lead compounds. Lead compounds identified are 
subsequently co-crystallized with the target to obtain information on how the 
binding of the small molecule can be improved (Zeslawska et aL 2000, J Mol 

0 Biol 301:465-75). Based on these findings, novel compoimds are designed. 
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synthesized, tested, and co-crystallized. This optimization process is repeated 
for several rounds leading to the development of a high-£iffinity compound of 
the invention that successfully inhibits the function of its target protein. 
Finally, the toxicity of the compound is tested using standard assays 
5 (commercially available service via MDS Pharma Services, Montreal, Quebec, 
Canada) after which it is screened in an animal model system. 

Ribozymes 

Trans-cleaving catalytic RNAs (ribozymes) are RNA molecules 
10 possessing endoribonuclease activity. Ribozymes are specifically designed for a 
particular target, and the target message must contain a specific nucleotide 
sequence. They are engineered to cleave any RNA species site-specifically in 
the background of cellular RNA. The cleavage event renders the mRNA 
imstable and prevents protein expression. Importantly, riboz3nne8 can be used 
15 to inhibit expression of a gene of unknown function for the purpose of 

determining its function in an in vitro or in vivo context, by detecting the 
phenotypic efiGect. 

One commonly used ribozyme motif is the hammerhead, for which the 
substrate sequence requirements are minimal. Design of the hammerhead 

20 ribozyme is disclosed in Usman et al., Cmrrent Opin. Struct. Biol. (1996) 6:527- 
533. Usman also discusses the therapeutic uses of ribozsnnes. Ribozymes can 
also be prepared and used as described in Long et al., FASEB J. (1993) 7:25; 
Symons, Ann. Rev. Biochem. (1992) 61:641; Perrotta et al., Biochem. (1992) 
31:16-17; Ojwang et al., Proc. Natl. Acad. Sci, (USA) (1992) 89:10802-10806; 

25 and U.S. Pat. No. 5,254,678. Ribozyme cleavage of HIV-I RNA is described in 
U.S. Pat. No. 5,144,019; methods of cleaving RNA using ribozymes is described 
in U.S. Pat. No. 5,116,742; and methods for increasing the specificity of 
ribozymes are described in U.S. Pat. No. 5,225,337 and Koiziuni et al.. Nucleic 
Acid Res. (1989) 17:7059-7071. Preparation and use of ribozyme firagments in a 

30 hammerhead structiire are also described by Koizumi et al.. Nucleic Acids Res. 
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(1989) 17:7059-7071. Preparation and use of riboz5ane fragments in a hairpin 
structure are described by Chowrira and Burke, Nucleic Acids Res. (1992) 
20:2835. Ribozymes can also be made by rolling transcription as described in 
Daubendiek and Kool, Nat. BiotecbnoL (1997) 15(3):273-277. 
5 The hybridizing region of the ribozyme may be modified or may be 

prepared as a branched structure as described in Horn and Urdea, Nucleic 
Acids Res. (1989) 17:6959-67. The basic structure of the ribozymes may also be 
chemically altered in ways familiar to those skilled in the art, and chemically 
synthesized ribozymes can be administered as synthetic oligonucleotide 

10 derivatives modified by monomeric units. In a therapeutic context, Uposome 
mediated delivery of ribozymes improves cellular uptake, as described in 
Birikh et al., Eur. J. Biochem. (1997) 245:1-16. 

Therapeutic and functional genomic apphcations of ribozymes proceed 
beginning with knowledge of a portion of the coding sequence of the gene to be 

15 inhibited. Thus, for many genes, a nucleic acid sequence provides adequate 
sequence for constructing an effective ribozyme. A target cleavage site is 
selected in the target sequence, and a riboz5rme is constructed based on the 5' 
and 3' nucleotide sequences that flank the cleavage site. Retroviral vectors are 
engineered to express monomeric and mtdtimeric hammerhead ribozymes 

20 targeting the mRNA of the target coding sequence. These monomeric and 
multimeric ribozymes are tested in vitro for an ability to cleave the target 
mRNA. A cell line is stably transduced with the retroviral vectors expressing 
the ribozymes, and the transduction is confirmed by Northern blot analysis 
and reverse-transcription polymerase chain reaction (RT-PCR). The cells are 

25 screened for inactivation of the target mRNA by such indicators as reduction of 
expression of disease markers or reduction of the gene product of the target 
mRNA. 

Antisense 
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Antisense polynucleotides are designed to specifically bind to RNA, 
resulting in the formation of RNA-DNA or RNA-RNA hybrids, with an arrest 
of DNA replication, reverse transcription or messenger RNA translation. 
Antisense poljnaucleotides based on a selected sequence can interfere with 
5 expression of the corresponding gene. 

Antisense polynucleotides are typically generated within the cell by 
expression from antisense constructs that contain the antisense strand as the 
transcribed strand. Antisense polynucleotides wUl bind and/or interfere with 
the translation of the corresponding mRNA. As such, antisense may be used 

10 therapeutically the inhibit the expression of oncogenes. 

Antisense RNA or antisense ohgodeoxynucleotides (antisense ODNs) 
can both be used and may also be prepared in vitro synthetically or by means 
of recombinant DNA techniques. Both methods are well within the reach of the 
person skilled in the art, ODNs are smaller than complete antisense RNAs and 

15 have therefore the advantage that they can more easily enter the target cell. In 
order to avoid their digestion by DNAse, ODNs and antisense RNAs may be 
chemicadly modified. For targeting to the desired target cells, the molecides 
may be linked to Ugands of receptors found on the target cells or to antibodies 
directed against molecules on the surface of the target cells. 

20 Antisense therapy for a variety of cancers is in clinical phase and has 

been discussed extensively in the Uterature. Reed reviewed antisense therapy 
directed at the Bcl-2 gene in tumors; gene transfer-mediated overexpression of 
Bcl-2 in tumor cell lines conferred resistance to many types of cancer drugs, 
(Reed, J. C, N.C.L. (1997) 89:988-990). The potential for clinical development 

25 of antisense inhibitors of ras is discussed by Cowsert, L. M., Anti-Cancer Drug 
Design (1997) 12:359-371. Additional important antisense targets include 
leukemia (Geurtz, A. M., Anti-Cancer Drug Design (1997) 12:341-358); human 
C-ref kinase (Monia, B. P., Anti-Cancer Drug Design (1997) 12:327-339); and 
protein kinase C (McGraw et al., Anti-Cancer Drug Design (1997) 12:315-326. 
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Given the extensive background literatiu-e and clinical experience in 
antisense therapy, one skilled in the art can use selected nucleic acids of the 
invention as additional potential therapeutics. 



5 RNAi 

RNAi refers to the introduction of homologous double stranded RNA to 
specifically target the transcription product of a gene, resulting in a null or 
hypomorphic phenotype. RNA interference requires an initiation step and an 
effector step. In the first step, input double -stranded (ds) RNA is processed 

10 into nucleotide 'guide sequences*. These may be single- or double-stranded. The 
guide RNAs are incorporated into a nuclease complex, called the RNA-induced 
silencing complex (RISC), which acts in the second effector step to destroy 
mRNAs that are recognized by the guide RNAs through base-pairing 
interactions. RNAi molecules are thus double stranded RNAs (dsRNAs) that 

15 are very potent in sUencing the expression of the target gene. The invention 
provides dsRNAs complementary to the genes listed in Table 1. 

The ability of dsRNA to suppress the expression of a gene corresponding 
to its own sequence is also called post-transcriptional gene silencing or PTGS. 
The only RNA molecules normally foxmd in the cytoplasm of a cell are 

20 molecules of single -stranded mRNA. If the cell finds molecules of double- 
stranded RNA, dsRNA, it uses an enzyme to cut them into fragments 
containing in genersd 21-base pairs (about 2 turns of a double helix). The two 
strands of each firagment then separate enough to expose the antisense strand 
so that it can bind to the complementary sense sequence on a molecule of 

25 mRNA. This triggers cutting the mRNA in that region thus destro3dng its 

ability to be translated into a polypeptide. Introducing dsRNA corresponding 
to a particular gene will knock out the cell's endogenous expression of that 
gene. This can be done in particular tissues at a chosen time. A possible 
disadvantage of simply introducing dsRNA fi-agments into a cell is that gene 

30 expression is only temporarily reduced. However, a more permanent solution 
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is provided by introducing into the cells a DNA vector that can continuously 
synthesize a dsRNA corresponding to the gene to be suppressed. 

RNAi molecules are prepared by methods well known to the person 
skilled in the art. In general an isolated nucleic acid sequence comprising a 
5 nucleotide sequence which is substantially homologous to the sequence of at 
least one of the murine genomic regions Hsted in Table 1 and which is capable 
of forming one or more transcripts able to form a partially of fully double 
stranded (ds) RNA with (part of) the transcription product of said murine 
cancer genes will function as an RNAi molecule. The double stranded region 
10 may be in the order of between 10-250, preferably 10-100, more preferably 20- 
50 nucleotides in length. 

RNAi molecules are preferably expressed from recombinant vectors in 
transduced host cells, hematopoietic stem cells being very suitable thereto. 

15 Dominant Negative Mutations 

Dominant negative mutations are readily generated for corresponding 
proteins that are active as multrmers. A mutant polypeptide will interact with 
wild-type polypeptides (made from the other aUele) and form a non-functional 
multimer. Thus, a mutation is in a substrate-binding domain, a catalytic 

20 domain, or a cellular localization domain. Preferably, the mutant polypeptide 
wiQ be overproduced. Point mutations are made that have such an effect. In 
addition, fusion of different polypeptides of various lengths to the terminus of a 
protein can yield dominant negative mutants. General strategies are available 
for making dominant negative mutants. See Herskowitz, Nature (1987) 

25 329:219-222. Such a technique can be used for creating a loss of function 
mutation, which is useful for determining the function of a protein. 



30 



Detection probes and amplification primers 

Polynucleotide probes and primers comprising at least 8, preferably at 
least 10, more preferably at least 12 contiguous nucleotides selected from the 
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nucleotide sequence of a murine genomic region listed in Table 1 or selected 
from the nucleotide sequence of genes affected by transformations in said 
murine genomic regions or complements thereof are used for a variety of 
purposes, including identification of human chromosomes and deterroining 
5 transcription levels of these genes. . 

The nucleotide probes are labeled, for example, with a radioactive, 
fluorescent, biotinylated, or chemiluminescent label, and detected by well 
known methods appropriate for the particular label selected. Protocols for 
hybridizing nucleotide probes to preparations of metaphase chromosomes are 

10 also well known in the art. A nucleotide probe will hybridize specifically to 
nucleotide sequences in the chromosome preparations which are 
complementary to the nucleotide sequence of the probe. A probe that 
hybridizes specifically to a polynucleotide should provide a detection signal at 
least 5-, 10-, or 20-fold higher than the background hybridization provided 

15 with other unrelated sequences. 

Nucleotide probes are used to detect expression of a gene corresponding 
to the polynucleotide. For example, in Northern blots, mRNA is separated 
electrophoretically and contacted with a probe. A probe is detected as 
hybridizing to an mRNA species of a particular size. The amoimt of 

20 hybridization is quantitated to determine relative amounts of egression, for 
example under a particular condition. 

Probes are also used to detect products of amplification by poljonerase 
chain reaction. The products of the reaction are hybridized to the probe and 
hybrids are detected. Probes are used for in situ hybridization to cells to detect 

25 expression or to perform chromosome painting. Probes can also be used in vivo 
for diagnostic detection of hybridizing sequences. Probes are tj^icaUy labeled 
with a fluorescent or luminescent label. Other types of detectable labels may 
be used such as chromophores, radioactive isotopes, and enzymes. 

Expression of specific mRNA can vary in different cell types and can be 

30 tissue specific. This variation of mRNA levels in different cell types can be 
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exploited with nucleic acid probe assays to determine disease stages. For 
example, PGR, branched DNA probe assays, or blotting techniques utilizing 
nucleic acid probes substantially identical or complementary to nucleic acid 
sequences of murine genomic regions listed in Table 1 or of genes affected by 
5 transformations therein can determine the presence or absence of cDNA or 
mRNA related to the polynucleotides of the invention. 

Examples of a nucleotide hybridization assay are described in Urdea et 
al., PCT WO92/02526 and Urdea et aL, U.S. Pat. No. 5,124,246, both 
incorporated herein by reference. The references describe an example of a 

10 sandwich nucleotide hybridization assay. 

Alternatively, the Polymerase Chain Reaction (PGR) is another means 
for detecting small amounts of target nucleic acids, as described in MuUis et 
al., Meth. Enzymol. (1987) 155:335-350; U.S. Pat. No. 4,683,195; and U.S. Pat. 
No. 4,683,202, all incorporated herein by reference. Two primer 

15 polynucleotides nucleotides hybridize with the target nucleic acids and are 
used to prime the reaction. The primers may be composed of sequence within 
or 3' and 5* to the nucleic acid sequences of mxirine genomic regions listed in 
Table 1 or of genes affected by transformations therein. Alternatively, if the 
primers are 3' and 5' to these nucleic acid sequences, they need not hybridize 

20 to them or the complements. Yet, they may for instance also hybridize to 

sequences flanking the nucleic acid sequences of murine genomic regions listed 
in Table 1 or of genes affected by transformations therein. A thermostable 
polymerase creates copies of target nucleic acids from the primers using the 
original target nucleic acids as a template. After a large amount of target 

25 nucleic acids is generated by the poljonerase, it is detected by methods such as 
Southern blots. When using the Southern blot method, the labeled probe will 
hybridize to a nucleic acid sequence of a murine genomic region listed in Table 
1 or of a gene affected by a transformation therein or complement. 

Furthermore, mRNA or cDNA can be detected by traditional blotting 

30 techniques described in Sambrook et al., "Molecular Cloning: A Laboratory 
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Manual" (New York, Cold Spring Harbor Laboratory, 1989). mRNA or cDNA 
generated from mRNA using a polymerase enzyme can be purified and 
separated using gel electrophoresis. The nucleic acids on the gel are then 
blotted onto a solid support, such as nitrocellulose. The soHd support is 

5 exposed to a labeled probe and then washed to remove any unhybridized probe. 
Next, the duplexes containing the labeled probe are detected. Typically, the 
probe is labeled with radioactivity. 

Additionally, mRNA or cDNA can be detected by using probe arrays. 
mRNA is obtained for instance from tumor tissue and cDNA is generated 

0 therefrom by reverse PGR. The cDNA is optionally labeled by incorporation of 
fluorescently labeled nucleotide analogues therein and is hybridized to the 
probe array. Expression levels are determined from the intensity of the 
fluorescent signal obtained firom the corresponding or respective probe sites on 
the array. 



Use of Polynucleotides to Raise Antibodies 

The present invention provides in one aspect for antibodies suitable for 
therapeutic and/or diagnostic use. 

Therapeutic antibodies include antibodies that can bind specifically to 
the expression products of the genes (partly) encoded in the nucleic acid 
sequences of murine genomic regions listed in Table 1 or of genes affected by 
transformations in these regions. By binding directly to the gene products, the 
antibodies may influence the function of their targets by, for example, in the 
case of proteins, steric hindrance, or by blocking at least one of the functional 
domains of those proteins. As such, these antibodies may be used as inhibitors 
of the function of the gene product. Such antibodies may for instance be 
generated against functionally relevant domains of the proteins and 
subsequently screened for their ability to interfere with the target's function 
using standard techniques and assays (see for instance Schwartzberg, 2001, 
Crit Rev Oncol Hematol 40:17-24; Herbst et aL Cancer 94:1593-611, 2002). 
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Alternatively, anti-RNA antibodies may for instance be useful in 
silencing messengers of the timaor-related genes of the present invention. In 
another alternative, antibodies may elIso be used to influence the function of 
their targets indirectly, for instance by binding to members of signsding 

5 pathways in order to influence the function of the targeted proteins or nucleic 
acids. In yet another alternative, therapeutic antibodies may carry one or more 
toxic compounds that exert their effect on the target or target cell by virtue of 
the binding of the carrying antibody thereto. 

For diagnostic purposes, antibodies similar to those above, preferably 

0 those that are capable of binding to the expression products of the genes of the 
present invention may be used, and that are provided with detectable labels 
such as fluorescent, Itiminescent, or radio-isotope labels in order to allow the 
detection of the gene product. Preferably such diagnostic antibodies are 
targeted to proteinaceous targets present on the outer envelop of the cell, such 

5 as membrane bound target proteins. The use of antibodies for the diagnosis of 
specific types of cancer is known to the skilled person (Sjrrigos et aL 1999, 
Hybridoma 18:219-24). 

The antibodies used in the present invention may be from any animal 
origin including birds and mammals (e.g., human, murine, donkey, sheep, 

0 rabbit, goat, guinea pig, camel, horse, or chicken). Preferably, the antibodies of 
the invention are human or humanized monoclonal antibodies. As used herein, 
'Tiuman" antibodies include antibodies having the amino acid sequence of a 
human immunoglobulin and include antibodies isolated from human 
immunoglobulin libraries (including, but not limited to, synthetic libraries of 

5 immunoglobulin sequences homologous to human immxmoglobulin sequences) 
or from mice that express antibodies from human genes. 

For some \ises, including in vivo therapeutic or diagnostic use of 
antibodies in humans and in vitro detection assays, it may be preferred to use 
human or chimeric antibodies. Completely human antibodies are particularly 

0 desirable for therapeutic treatment of human subjects. Human antibodies can 
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be made by a variety of methods known in the art including phage display 
methods described above using antibody HbrsLries derived from human 
immtmoglobuhn sequences or synthetic sequences homologous to human 
immunoglobuhn sequences. See also U.S. Patent Nos. 4,444,887 and 4,716,111; 
5 and PCT pubUcations WO 98/46645, WO 98/50433, WO 98/24893 and 

W098/16654, each of which is incorporated herein by reference in its entirety. 

The antibodies to be used with the methods of the invention include 
derivatives that are modified, i.e, by the covalent attachment of any type of 
molecule to the antibody such that covalent attachment. Additionally, the 

10 derivative may contain one or more non-classical amino acids. 

In certain embodiments of the invention, the antibodies to be used with 
the invention have extended half-Uves in a mammal, preferably a human, 
when compared to xmmodified antibodies. Antibodies or antigen-binding 
fragments thereof having increased in vivo half-lives can be generated by 

15 techniques known to those of skill in the art (see, e.g,, PCT Pubhcation No. WO 
97/34631). 

In certain embodiments, antibodies to be used with the methods of the 
invention are single-chain antibodies. The design and construction of a single- 
chain antibody is described in Marasco et al, 1993, Proc Natl Acad Sci 90:7889- 

20 7893, which is incorporated herein by reference in its entirety. 

In certain embodiments, the antibodies to be used with the invention 
bind to an intracellular epitope, £.e., are intrabodies. An intrabody comprises at 
least a portion of an antibody that is capable of immunospecifLcally binding an 
antigen and preferably does not contain sequences coding for its secretion. 

25 Such antibodies will bind its antigen intracellularly. In one embodiment, the 
intrabody comprises a single -chain Fv C'sPv"). For a review of sFv see 
Pluckthun in The Pharmacology of Monoclonal Antibodies^ voL 113, Rosenbxirg 
and Moore eds. Springer- Verlag, New York, pp. 269-315 (1994). In a further 
embodiment, the intrabody preferably does not encode an operable secretory 

30 sequence and thus remains within the cell (see generally Marasco, WA, 1998, 
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"Intrabodies: Basic Research and Clinical Gene Therapy Applications" 
Springer: New York). 

Generation of intrabodies is well-known to the skilled artisan and is 
described for example in U.S. Patent Nos. 6,004,940; 6,072,036; 5,965,371, 
which are incorporated by reference in their entireties herein. 

In one embodiment, intrabodies are expressed in the cytoplasm. In other 
embodiments, the intrabodies are localized to various intracellular locations. 
In such embodiments, specific localization sequences can be attached to the 
intranucleotide polypepetide to direct the intrabody to a specific location. 

The antibodies to be used with the methods of the invention or 
fragments thereof can be produced by any method known in the art for the 
synthesis of antibodies, in particular, by chemical synthesis or preferably, by 
recombinant expression techniques. 

Monoclonal antibodies can be prepared using a wide variety of 
techniques known in the art including the use of hybridoma, recombinant, and 
phage display technologies, or a combination thereof. For example, monoclonal 
antibodies can be produced using hybridoma techniques including those known 
in the art and taught, for example, in Harlow et aZ., Antibodies: A Laboratory 
Manualy (Cold Spring Harbor Laboratory Press, 2nd ed. 1988); Hammerling, et 
ah, in: Monoclonal Antibodies and T-Cell Hybridomas 563-681 (Elsevier, N.Y., 
1981) (said references incorporated by reference in their entireties). The term 
"monoclonal antibody^' as used herein is not limited to antibodies produced 
through hybridoma technology. The term "monoclonal antibody" refers to an 
antibody that is derived from a single clone, including any eukaryotic, 
prokaryotic, or phage clone, and not the method by which it is produced. 

Examples of phage display methods that can be used to make the 
antibodies of the present invention include those disclosed in W097/13844; and 
U.S. Patent Nos. 5,580,717, 5,821,047, 5,571,698, 5,780,225, and 5,969,108; 
each of which is incorporated herein by reference in its entirety. 
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As described in the above references, after phage selection, the antibody 
coding regions from the phage can be isolated and used to generate whole 
antibodies, including human antibodies, or any other desired antigen binding 
fragment, and expressed ia any desired host, including maromalian cells, 
5 insect cells, plant cells, yeast, and bacteria, e.g. , as described below. 

Techniques to recombinantly produce Fab, Fab' and F(ab')2 fragments can also 
be employed using methods known in the art such as those disclosed in PCT 
publication No. WO 92/22324; MuUinax et al., 1992, BioTechniques 12(6):864- 
869 and Better et al, 1988, Science 240:1041-1043 (said references 

10 incorporated by reference in their entireties). 

It is also possible to produce therapeutically useful IgG, IgA, IgM and 
IgE antibodies. For an overview of the technology for producing human 
antibodies, see Lonberg and Huszar (1995, Int. Rev. ImmunoL 13:65-93). For a 
detailed discussion of the technology for produciag human antibodies and 

15 human monoclongd antibodies and protocols for producing such antibodies, see, 
e,g., PCT publication No. WO 98/24893, which is incorporated by reference 
herein in its entirety. In addition, companies such as Medarex, Inc. (Princeton, 
NJ), Abgenix, Inc. (Freemont, CA) and Genpharm (San Jose, CA) can be 
engaged to provide human antibodies directed against a selected antigen using 

20 technology similar to that described above. 

Recombinant expression used to produce the antibodies, derivatives or 
analogs thereof (e.g-., a heavy or light chain of an antibody of the invention or a 
portion thereof or a single chsdn antibody of the invention), requires 
construction of an expression vector containing a polynucleotide that encodes 

25 the antibody and the expression of said vector in a suitable host cell or even in 
vivo.Once a polynucl eotide encoding an antibody molecule or a heavy or light 
chain of an antibody, or portion thereof (preferably, but not necessarily, 
containing the heavy or Ught chain variable domain), of the invention has been 
obtained, the vector for the production of the antibody molecule may be 

30 produced by recombinant DNA technology using techniques well known in the 
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art. Thus, methods for preparing a protein by expressing a polynucleotide 
containing an antibody encoding nucleotide sequence are described herein. 
Methods which are well known to those skilled in the art can be used to 
construct expression vectors containing antibody coding sequences and 
5 appropriate transcriptional and translational control signals. These methods 
include, for example, in vitro recombinant DNA techniques, synthetic 
techniques, and in vivo genetic recombination. The invention, thus, provides 
repUcable vectors comprising a nucleotide sequence encoding an antibody 
molecule of the invention, a heavy or Ught chain of an antibody, a heavy or 

10 light chain variable domain of an antibody or a portion thereof, or a heavy or 
light chain CDR, operably linked to a promoter. Such vectors may include the 
nucleotide sequence encoding the constant region of the antibody molecule 
(see, e.g., PCT PubHcation WO 86/05807; PCT Pubhcation WO 89/01036; and 
U.S. Patent No. 5,122,464) gind the variable domain of the antibody may be 

.15 cloned into such a vector for expression of the entire heavy, the entire Ught 
chain, or both the entire heavy and hght chains. 

The expression vector is transferred to a host cell by conventional 
techniques and the transfected cells are then cultured by conventional 
techniques to produce an antibody of the invention. Thus, the invention 

20 includes host cells containing a polynucleotide encoding ain antibody of the 
invention or fragments thereof, or a heavy or light chain thereof, or portion 
thereof, or a single chain antibody of the invention, operably linked to a 
heterologous promoter. In preferred embodiments for the expression of double- 
chained antibodies, vectors encoding both the heavy and light chains may be 

25 co-expressed in the host ceU for expression of the entire immunoglobulin 
molecule, as detailed below. 

A variety of host-expression vector systems may be utilized to' express 
the antibody molecules of the invention (see above) 

In mammalian host cells, a number of viral-based expression systems 

30 may be utilized. In cases where an adenovirus is used as an expression vector. 
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the antibody coding sequence of interest may be ligated to an adenovirus 
transcription/translation control complex, e.g., the late promoter and tripartite 
leader sequence. This chimeric gene may then be inserted in the adenovirus 
genome by in vitro or in vivo recombination. Insertion ia a non-essential region 
5 of the viral genome ie,g., region El or E3) will result in a recombinant virus 

that is viable and capable of expressing the antibody molecule in infected hosts 
(e.g., see Logan & Shenk, 1984, Proc. Natl. Acad. Sci. USA 81:355-359). 
Specific initiation signals may also be required for efficient translation of 
inserted antibody coding sequences. These signals include the ATG initiation 

10 codon and adjacent sequences. Furthermore, the initiation codon must be in 
phase with the reading frame of the desired coding sequence to ensure 
translation of the entire insert. These exogenous translational control signals 
and initiation codons can be of a variety of origins, both natural and synthetic. 
The efficiency of expression may be enhanced by the inclusion of appropriate 

15 transcription enhancer elements, transcription terminators, etc. (see, e.^., 
Bittner et aL, 1987, Methods in EnzymoL 153:516-544). 

Once an antibody molecide to be used with the methods of the invention 
has been produced by recombinant expression, it may be purified by any 
method known in the art for purification of an immxmoglobulin molecule, for 

20 example, by chromatography (e.g., ion exchange, affinity, particularly by 
affinity for the specific antigen after Protein A, and sizing column 
chromatography), centrifugation, differential solubility, or by any other 
standard technique for the purification of proteins. Further, the antibodies of 
the present invention or fragments thereof may be fused to heterologous 

25 polypeptide sequences described herein or otherwise known in the art to 
facUitate purification. 

As stated above, according to a further aspect, the invention provides an 
antibody as defined above for use in therapy. 

For therapeutic treatment, antibodies may be produced in vitro and 

30 applied to the subject in need thereof. The antibodies may be administered to a 
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subject by any suitable route, preferably in the form of a pharmaceutical 
composition adapted to such a route and in a dosage which is effective for the 
intended treatment. Therapeutically effective dosages of the antibodies 
required for decreasing the rate of progress of the disease or for eliminating 
the disease condition can easily be determined by the skilled person. 

Alternatively, antibodies may be produced by the subject itself by using 
in vivo antibody production methodologies as described above. Suitably, the 
vector used for such in vivo production is a viral vector, preferably a viral 
vector with a target cell selectivity for specific cancer cells. 

Therefore, accorduig to a still further aspect, the invention provides the 
use of an antibody as defined above in the manufacture of a medicament for 
use in the treatment of a mammal to achieve the said therapeutic effect. The 
treatment comprises the administration of the medicament in a dose sufficient 
to achieve the desired therapeutic effect. The treatment may comprise the 
repeated administration of the antibody. 

According to a still further aspect, the invention provides a method of 
treatment of a hiunan comprising the administration of an antibody as defined 
above in a dose sufficient to achieve the desired therapeutic effect. The 
therapeutic effect being the alleviation or prevention of various types of 
cancers such as bone cancers, brain tumors, breast cancer, endocrine system 
cancers, gastrointestinal cancers, gynaecologic cancers, head and neck cancers, 
leukemia, lung cancers, lymphomas, metastases, myelomas, pediatric cancers, 
penile cancer, prostate cancer, sarcomas, skin cancers, testicular cancer, 
thyroid cancer, xirinary tract cancers, more in particular the remission or 
prevention of relapse of solid tumors of bladder, cervix, lung, colon, breast, 
prostate, ovary, pancreas, Uver, testicle, uterus, bone, oral cavity and 
oropharynx tissue, preferable of leukemia, such as acute myeloid leukemia 
(AML), acute lymphocj^ic leukemia (ALL), chronic myelogenous leiikemia 
(CML) and chronic lymphocytic leukemia (CLL), most preferably of AML. 
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The diagnostic and therapeutic antibodies are preferably used in their 
respective application for the targeting of kinases or phosphatases, which are 
often coupled to receptor molecules on the cellos surface. As such, antibodies 
cap able of binding to these receptor molecules can exert their activity- 
5 modulating effect on the kinases or phosphatases by binding to the respective 
receptors. Also transporter proteins may be targeted with advantage for the 
same reason that the antibodies will be able to exert their activity-modulating 
effect when present extracellularly. The above targets, together with signaling 
molecules, represent preferred targets for the antibody uses of the invention as 

10 more effective therapy and easier diagnosis is possibly thereby. 

The diagnostic antibodies can suitably be used for the qualitative and 
quantitative detection of gene products, preferably proteins in assays for the 
determination of altered levels of proteins or structural changes therein. 
Protein levels may for instance be determined in cells, in cell extracts, in 

15 supematants, body fluids by for instance flow-cjrtometric evaluation of 

immimostained cancer cells. Alternatively, quantitative protein assays such as 
ELISA or RIA, Western blotting, and imaging technology (e.g., using confocal 
laser scanning microscopy) may be used in concert with the antibodies as 
described herein for the diagnosis of cancers, preferably leukemia. 

20 

Use of Polynucleotides to Construct Arrays for Diagnostics 

A polynucleotide capable of hybridising to a murine genomic region of 
the present invention as weU as sequences and gene products thereof is useful 
for determining the occurrence of cancer, tximor progression, hyperprohferative 

25 growth, and/or accompanjdng biological or physical manifestations. 

Specifically, the polynucleotides representing the murine genomic regions and 
genes affected by transformations therein and encoded pols^eptides can be 
utilized to determine the occurrence of leukemia, such as AML. 
To determine the occurrence of cancer, tumor progression, 

30 hyperproliferative growth, and/or accompanying biological or physical 
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manifestations, the levels of polynucleotides and/or encoded polypeptides of the 
present invention in a sample are compared to the levels in a normal control of 
body tissues, cells, organs, or fluids. The normal control can include a pool of 
cells from a particular organ or tissue or tissues and/or cells from throughout 
5 the body. For such measurements either immunoassays or nucleic acid assays 
as known in the art can be used. 

Any observed difference between the sample and normal control can 
indicate the occurrence of disease or disorder. Typically, if the levels of the 
polynucleotides and the encoded polypeptides of the present invention are 

10 higher than those found in the normal control, the results indicate the 

occurrence of cancer, tumor progression, hyperproUferative growth, and/or 
accompanjdng biological or physical manifestations. 

In addition, the present murine genomic regions can be useful to 
diagnose the severity as weU as the occurrence of cancer, tumor progression, 

15 hyperprohferative growth, and/or accompanying biological or physical 

manifestations. For example, the greater the difference observed in the sample 
versus the normal control of the expression products of the genes comprised in 
said genomic regions or affected by transformations therein, the greater the 
severity of the disorder, in particular, when higher levels as compared to a 

20 normal control are observed. 

Polynucleotide or oUgonucleotide arrays provide a high throughput 
technique that can assay a large number of polsmucleotide sequences in a 
sample. This technology can be used as a diagnostic and as a tool to test for 
differential expression to determine function of an encoded protein. 

25 Polynucleotide or oligonucleotide arrays constitute a specific embodiment of a 
diagnostic composition according to the present invention. 

To create arrays, polynucleotide or ohgonucleotide probes are spotted 
onto a substrate in a two-dimensional matrix or array. Samples of 
polynucleotides can be labeled and then hybridized to the probes. Double 

30 stranded polynucleotides, comprising the labeled sample polynucleotides 
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bound to probe polynucleotides or oligonucleotide, can be detected once the 
unboiind portion of the sample is washed away. 

The probe polynucleotides or ohgonucleotide can be spotted on 
substrates including glass, nitrocellulose, etc. The probes can be bound to the 
5 substrate by either covalent bonds or by non-specific interactions, such as 
hydrophobic interactions. The sample polynucleotides can be labeled using 
radioactive labels, fluorophors, etc. 

Techniques for constructing arrays and methods of using these arrays 
are described in EP No. 0 799 897; PCT No. WO 97/29212; PCT No. WO 
10 97/27317; EP No. 0 785 280; PCT No. WO 97/02357; U.S. Pat. No. 5,593,839; 
U.S. Pat. No. 5,578,832; EP No. 0 728 520; U.S. Pat. No. 5,599,695; EP No. 0 
721 016; U.S. Pat. No. 5,556,752; PCT No. WO 95/22058; and U.S. Pat. No. 
5,631,734. 

Further, arrays can be used to examine differential expression of genes 
15 and cam be used to determine gene function. For example, arrays of the instant 
polynucleotide sequences of the murine genomic regions and of genes affected 
by transformations therein can be \xsed to determine if any of the sequences 
are differentially esqpressed between normal cells and cancer cells, for example. 
High expression of a partLctilar message in a cancer cell, which is not observed 
20 in a corresponding normal cell, can indicate a cancer specific protein. 

Differential Expression 

The present invention also provides a method to identrfy abnormal or 
diseased tissue in a hxmian. The expression of a gene corresponding to a 

25 specific polynucleotide is compared between a first tissue that is suspected of 
being diseased and a second, normal tissue of the human. The normal tissue is 
any tissue of the human, especially those that express the murine genomic 
region-related gene including, but not Umited to, blood ceUs> brain, thymus, 
testis, heart, prostate, placenta, spleen, small intestine, skeletal muscle, 

30 pancreas, and the mucosal lining of the colon. 
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The expression products of murine genomic region-related genes in the 
two tissues are compared by any means known in the art. 

Murine genomic region -related mRNA in the two tissues is compared. 
PolyA RNA is isolated from the two tissues as is known in the art. For 
5 example, one of skill in the art can readily determine differences in the size or 
amount of polynucleotide-related mRNA transcripts between the two tissues 
using Northern blots and nucleotide probes. Increased or decreased expression 
of an polynucleotide-related mRNA in a tissue sample suspected of being 
diseased, compared with the expression of the same polynucleotide-related 
10 mRNA in a normal tissue, indicates manifestation of the disease. 

Screening for Peptide Analogs and Antagonists 

Polypeptide expression products of murine genomic region-related genes 
as well as corresponding full length genes can be used to screen peptide 
15 Ubraries to identify binding partners, such as receptors, from among the 
encoded polj^eptides. 

Such binding partners can be useful in treating cancer, t\mxor 
progression, hyperprohferative cell growth, and/or accompanying biological or 
physical manifestations. For example, peptides or other compotmds that are 
20 capable of binding or interacting with membrane-bound polypeptides encoded 
by one or more of the murine genomic regions listed in Table 1, can be useful 
as a therapeutic. Also, peptides or other compounds capable of altering the 
. conformation of any of the encoded polypeptides by one or more of the murine 
genomic regions listed in Table 1 can inhibit biological activity and be useful 
25 as a therapeutic. 

A Hbrary of peptides may be synthesized following the methods 
disclosed in U.S. Pat. No. 5,010.175, and in PCT W091/17823. 

Peptide agordsts or antagonists are screened using any available 
method, such as signal transduction, antibody binding, receptor binding, 
30 mitogenic assays, chemotaxis assays, etc. The methods described herein are 
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presently preferred. The assay conditions ideally should resemble the 
conditions under which the native activity is exhibited in vivo, that is, under 
physiologic pH, temperature, and ionic strength. Suitable agonists or 
antagonists will exhibit strong inhibition or enhancement of the native activity 
5 at concentrations that do not cause toxic side effects in the subject. Agonists or 
antagonists that compete for binding to the native polypeptide may require 
concentrations equal to or greater than the native concentration, while 
inhibitors capable of binding irreversibly to the polypeptide may be added in 
concentrations on the order of the native concentration. 

10 The end results of such screening and experimentation will be at least 

one novel polypeptide binding partner, such as a receptor, encoded by a murine 
genomic region-related gene of the invention, and at least one peptide agonist 
or antagonist of the novel binding partner. Such agonists and antagonists can 
be used to modulate receptor function in cells to which the receptor is native, 

15 or in cells that possess the receptor as a resxilt of genetic engineering. Further, 
if the novel receptor shares biologically important characteristics with a known 
receptor, information about agonist/antagonist binding may help in developing 
improved agonists/antagonists of the known receptor. 

Therapeutics, whether polynucleotide or polypeptide or small molecide, 

20 can be tested, for example, in the mouse tumor assay described rn Pei et al., 
MoL Endo. 11: 433-441 (1997). 

Other models for testing polynucleotides, polypeptides, antibodies, or 
small molecules useful for treatment include: animal models and cell lines 
disclosed in Bosland, Encyclopedia of Cancer, Volume II, pages 1283 to 1296 

25 (1997) by Academic Press. Other useful cell hnes are described in Brothman, 
Encyclopedia of Cancer, Volume 11, pages 1303 to 1313 (1997) by Academic 
Press 

Pharmaceutical Compositions and Therapeutic Uses 
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Pharmaceutical compositions can comprise polypeptides, antibodies, 
polynucleotides (antisense, RNAi, ribozyme), or small molecules of the claimed 
invention, collectively called inhibitor compounds herein. The pharmaceutical 
compositions will comprise a therapeutically effective amount of either . 
5 polypeptides, antibodies, polynucleotides or small molecules of the claimed 
invention. 

The term "therapeutically effective amount" as used herein refers to an 
amount of a therapeutic agent to treat, ameUorate, or prevent a desired 
disease or condition, or to exhibit a detectable therapeutic or preventative 

0 effect. The effect can be detected by, for example, chemical markers or antigen 
levels. Therapeutic effects also include reduction in physical sjnmptoms, such 
as decreased body temperature. The precise effective amount for a subject will 
depend upon the subject's size and health, the nature and extent of the 
condition, and the therapeutics or combination of therapeutics selected for 

5 administration. Thus, it is not useful to specify an exact effective amount in 
advance. However, the effective amount for a given situation can be 
determined by routine experimentation and is within the judgment of the 
clinician. Specifically, the compositions of the present invention can be used to 
treat, ameliorate, or prevent cancer, tumor progression, hyperproliferative cell 

0 growth and/or accompanjdng biological or physical manifestations, including 
leukaemia. 

For purposes of the present invention, an effective dose will be from 
about 0.01 mg/ kg to 50 mg/kg or 0.05 mg/kg to about 10 mg/kg of the 
polynucleotide, polypeptide or antibody compositions in the individual to which 
5 it is administered. 

A pharmaceutical composition can also contain a pharmaceutically 
acceptable carrier. The term "pharmaceutically acceptable carrier" refers to a 
carrier for administration of a therapeutic agent, such as antibodies or a 
polypeptide, genes, and other therapeutic agents. The term refers to ajay 
D pharmaceutical carrier that does not itself induce the production of antibodies 
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harmful to the individual receiving the composition, and which may be 
adroinistered without undue toxicity. Suitable carriers may be large, slowly 
metabolized macromolecules such as proteins, polysaccharides, polylactic 
acids, polyglycolic acids, polymeric amino acids, amino acid copolymers, and 
5 inactive virus particles. Such carriers are well known to those of ordinary skill 
in the art. 

Pharmaceutically acceptable salts can be used therein, for example, 
mineral acid salts such as hydrochlorides, hydrobromides, phosphates, 
sulfates, and the like; and the sgdts of organic acids such as acetates, 

10 propionates, malonates, benzoates, and the hke. A thorough discussion of 
pharmaceutically acceptable excipients is available in Remington's 
Pharmaceutical Sciences (Mack Pub. Co., N.J. 1991). 

Pharmaceutically acceptable carriers in therapeutic compositions may 
contain hquids such as water, saline, glycerol and ethanol. Additionally, 

15 auxiliary substances, such as wetting or emulsifying agents, pH bu£fering 
substances, and the like, may be present in such vehicles. Typically, the 
therapeutic compositions are prepared as injectables, either as hquid solutions 
or suspensions; solid forms suitable for solution in, or suspension in, liquid 
vehicles prior to injection may also be prepared. Liposomes are included within 

20 the dejBnition of a pharmaceutically acceptable carrier. 

Delivery Methods 

Once formulated, the pharmaceutical compositions of the invention can 
be (1) administered directly to the subject; (2) delivered ex vivo, to cells derived 
25 from the subject; or (3) deUvered in vitro for expression of recombinant 
proteins. 

Direct delivery of the compositions will generally be accomplished by 
injection, either subcutaneously, intraperitoneally, intravenously or 
intramuscularly, or delivered to the interstitial space of a tissue. The 
30 compositions can also be administered into a tumor or lesion. Other modes of 



wo 2004/016317 



PCT/NL2003/000583 



55 

administration include topical, oral, catheterized and pulmonary 
administration, suppositories, and transdermal applications, needles, and 
particle gimsor hyposprays. Dosage treatment may be a single dose schedvile 
or a multiple dose schedule. 
5 Methods for the ex vivo delivery and reimplantation of transformed cells 

into a subject are known in the art and described in e.g.. International 
Publication No. WO 93/14778. Examples of cells useful in ex vivo applications 
include, for example, stem cells, particularly hematopoetic, lymph cells, 
macrophages, dendritic cells, or tumor cells. 

10 Generally, dehvery of nucleic acids for both ex vivo and in vitro 

applications can be accompUshed by, for example, dextran-mediated 
transfection, calcium phosphate precipitation, polybrene mediated 
transfection, protoplast fusion, electroporation, encapsvilation of the 
polynucleotide (s) in hposomes, and direct microinjection of the DNA into 

15 nuclei, aU well known in the art. 

Various methods are used to administer the therapeutic composition 
directly to a specific site in the body. For example, a small metastatic lesion is 
located and the therapeutic composition injected several times in several 
different locations within the body of tumor. Alternatively, arteries which 

20 serve a txmior are identified, and the therapeutic composition injected into 
such an artery, in order to dehver the composition directly into the tumor. A 
tumor that has a necrotic center is aspirated and the composition injected 
directly into the now empty center of the tumor. The antisense composition is 
directly administered to the surface of the tumor, for example, by topical 

25 application of the composition. X-ray imaging is used to assist in certain of the 
above dehvery methods. 

Receptor-mediated targeted dehvery of therapeutic compositions 
containing an antisense polsoiudeotide, sub genomic pol3aiucleotides, or 
antibodies to specific tissues is also used. Receptor-mediated DNA dehvery 

30 techniques are described in, for example, Findeis et al., Trends in Biotechnol. 
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(1993) 11:202-205; Chiou et al., (1994) Gene Therapeutics: Methods And 
Applications Of Direct Gene Transfer (J. A. Wolff, ed.); Wu et al., J. Biol. 
Chem. (1994) 269:542-46. Preferably, receptor-mediated targeted delivery of 
therapeutic compositions containing antibodies of the invention is used to 
5 deliver the antibodies to specific tissue. 

Pharmaceutical compositions containing antisense, ribozyme or RNAi 
polynucleotides are administered in a range of about 100 ng to about 200 mg of 
polynucleotides for local administration in a gene therapy protocol. 
Concentration ranges of about 500 ng to about 50 mg, about 1 fig to about 2 

10 mg, about 5 |ig to about 500 jig, and about 20 \ig to about 100 p.g of 

polynucleotides can also be used during a gene therapy protocol. Factors such 
as method of action and efficacy of transformation and expression are 
considerations which will affect the dosage required for ultimate efficacy of the 
polynucleotides. Where greater expression is desired over a larger area of 

15 tissue, larger amounts of polynucleotides or the same amounts readministered 
in a successive protocol of administrations, or several administrations to 
different adjacent or close tissue portions of, for example, a timior site, may be 
reqtiired to effect a positive therapeutic outcome. In all cases, routine 
experimentation in clinical trials will determine specific ranges for optimal 

20 therapeutic effect. A more complete description of gene therapy vectors, 

especially retroviral vectors, is contained in U,S. Ser. No. 08/869,309, which is 
expressly incorporated herein. 

Development of specific therapeutic methods 

25 As illustrated by the case wherein a specific translocation, e.g. the 

t(15,17) translocation as described above, can be assigned to a specific 
mutation and to a dysfunctional gene, the identity and more specifically the 
functionality of that gene may aid significantly in the development of methods 
of treatment. The functionality of a gene may be determined by methods 

30 known in the art, for instance by structure-function analysis. 
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Genes that are shown to be both dififerentiaJly expressed and 
functionally important for human tumors are selected for structure-function 
analysis. Deletion and point mutation mutants of these genes are constructed 
and tested for their functional competence compared with wild-tjrpe genes 
5 (according to Ibanez et al. Structural and functional domains of the myb 
oncogene: requirements for nuclear transport, myeloid transformation, and 
colony formation. J Virol 62:1981-8, 1988; Rebay et aL Specific truncations of 
Drosophila Notch define dominant activated and dominant negative forms of 
the receptor. CeU 74:319-29, 1993). These studies lead to the identification of 

10 functionally critical domains as well as to dominant-negative variants, i.e. 

mutant genes that suppress the function of their endogenous counterpsurts (e.g. 
Kashles et al. A dominant negative mutation suppresses the function of normal 
epidermal growth factor receptors by heterodimerization. Mol Cell Biol 
11:1454-63, 1991). These dominant-negatives provide additional proof for the 

15 functional importance of the genes and give insight into which therapeutic 
• approaches can be pursued to interfere with their function or that can support 
their impaired function. 

Development of an inhibitor compound 

20 The present invention also provides methods for the development and 
production of inhibitor compounds capable of inhibiting the genes or 
expression products disclosed in the present invention that play a role in the 
development of cancer. Essentially, such methods are weU known in the art 
and generally comprise a number of consecutive steps starting with the 

25 identification of genes involved in cancer, in particular as disclosed in 

Examples 2 and 3 below, by using retroviral insertional tagging, optionally in 
a specific genetic background; 

b) validation of one or more of the identified genes as potential target gene(s) 
for the inhibitor compound by one or more of the following methods: 
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- confirmation of the identified gene by Northern Blot analysis in cancer cell- 
lines; 

- determination of the expression profile of the identified gene in tumors and 
normal tissue; 

5 - determination of the functional importance of the identified genes for cancer; 
c) production of the expression product of the gene; and 

d) use of the expression product of the gene for the production or design 
of an inhibitor compound. 

0 Functional importance of the identified genes for human cancer 

The functional importance of the differentially expressed genes for 
tumor cells is determined by over-expression as weD as by inhibition of the 
expression of these genes. Selected human tumor cell lines are transfected 
either with plasmids encoding cDNA of the genes or with plasmids encoding 

5 RNA interference probes for the genes. RNA interference is a recently 
developed technique that involves introduction of double-stranded 
ohgonucleotides designed to block expression of a specific gene (see Elbashir et 
al Nature 411:494-8, 2001; Brummelkamp et aL Science 296:550-3, 2002). 
Using standard laboratory techniques and assays, the transfected cell lines are 

0 extensively checked for altered phenotypes that are relevant for the tumor 

cells, e.g. cell cycle status, proliferation, adhesion, apoptosis, invasive abilities, 
etc. These e3cperiments demonstrate the functional importance of the identified 
genes for human cancer. 



Gene Therapy 

The therapeutic polynucleotides and polypeptides of the present 
invention may be utilized in gene deUvery vehicles. The gene delivery vehicle 
may be of viral or non- viral origin (see generally, JoUy, Cancer Gene Therapy 
(1994) 1:51-64; Kimtira, Human Gene Therapy (1994) 5:845-852; Connelly, 
Human Gene Therapy (1995) 1:185-193; and KapKtt, Nature Genetics (1994) 
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6:148-153). Gene therapy vehicles for dehvery of constructs including a coding 
sequence of a therapeutic of the invention can be administered either locally or 
systemically. These constructs can utilize viral or non-viral vector approaches. 
Expression of such coding sequences can be induced using endogenous 
5 mammahan or heterologous promoters. Expression of the coding sequence can 
be either constitutive or regtJated. 

The present invention can employ recombinant retroviruses which are 
constructed to carry or express a selected nucleic acid molecule of interest. 
Retrovirus vectors that can be employed include those described in EP 0 415 
10 731; WO 94/03622; WO 93/25698; WO 93/25234; U.S. Pat. No. 5, 219,740; and 
EP 0 345 242. Preferred recombinant retroviruses include those described in 
WO 91/02805. 

Packaging cell lines suitable for use with the above-described retroviral 
vector constructs may be readily prepared (see PCT publications WO 95/30763 

15 and WO 92/05266), and used to create producer cell hnes (also termed vector 
cell lines) for the production of recombinant vector particles. Within 
particularly preferred embodiments of the invention, packaging ceU lines are 
made from human (such as HT1080 cells) or mink parent cell lines, thereby 
allowing production of recombinant retroviruses that can survive inactivation 

20 in human serum. 

The present invention also employs alphavirus-based vectors that can 
function as gene delivery vehicles. Such vectors can be constructed from a wide 
variety of alphaviruses, including, for example, Sindbis virus vectors, SemUki 
forest virus (ATCC VR.67; ATGC VR-1247), Ross River virus (ATCC VR-373; 

25 ATCC VR-1246) and Venezuelan equine encephalitis virus (ATCC VR-923; 
ATCC VR-1250; ATCC VR 1249; ATCC VR-532). Representative examples of 
such vector systems include those described in U.S. Pat. Nos. 5,091,309; 
5,217.879; and 5,185,440; and PCT Publication Nos. WO 92/10578; WO 
94/21792; WO 95/27069; WO 95/27044; and WO 95/07994. 
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Gene delivery vehicles of the present invention can also employ 
parvovirus such as adeno-associated virus (AAV) vectors. Representative 
examples include the AAV vectors disclosed by Srivastava in WO 93/09239, 
Samulski et al., J. Vir. (1989) 63:3822-3828; Mendelson et al., ViroL (1988) 
5 166:154-165; and Flotte et al., PNAS (1993) 90:10613-10617. ' 

Representative examples of adenoviral vectors include those described 
by Berkner, Biotechniques (1988) 6:616-627; Rosenfeld et al,. Science (1991) 
252:431-434; WO 93/19191; KoUs et al., PNAS (1994) 91:215-219; Kass-Eisler 
et al., PNAS (1993) 90:11498-11502; Guzman et al.. Circulation (1993) 

10 88:2838-2848; Guzman et al., Cir. Res. (1993) 73:1202-1207; Zabner et al., CeU 
(1993) 75:207-216; Li et al., Hum. Gene Ther. (1993) 4:403-409; Cailaud et al., 
Eur. J. Neurosci. (1993) 5:1287-1291; Vincent et al., Nat. Genet. (1993) 5:130- 
134; Jaffa et al., Nat. Genet. (1992) 1:372-378; and Levrero et al.. Gene (1991) 
101:195-202. Exemplary adenoviral gene therapy vectors employable in this 

15 invention also include those described in WO 94/12649, WO 93/03769; WO 
93/19191; WO 94/28938; WO 95/11984 and WO 95/00655. Administration of 
DNA linked to killed adenovirus as described in Curiel, Hum. Gene Ther. 
(1992) 3:147-154 may be employed. 

Other gene delivery vehicles and methods may be employed, including 

20 polycationic condensed DNA linked or imlinked to killed adenovirus alone, for 
example Curiel, Hum. Gene Ther. (1992) 3:147-154; ligand linked DNA, for 
example see Wu, J, Biol. Chem. (1989) 264:16985-16987; eukaryotic ceU 
delivery vehicles cells, for example see U.S. Ser. No. 08/240,030, filed May 9, 
1994, and U.S. Ser. No. 08/404,796; deposition of photopolymerized hydrogel 

25 materials; hand-held gene transfer particle gim, as described in U.S. Pat. No. 
5,149,655; ionizing radiation as described in U.S. Pat. No. 5,206,162 and in 
WO92/11033; nucleic charge neutralization or fusion with cell membranes. 
Additional approaches are described in Philip, Mol. Cell Biol. (1994) 14:2411- 
2418, and in Woffendin, Proc. Natl. Acad. Sci. (1994) 91:1581-1585. 
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Naked DNA may also be employed. Exemplary naked DNA introduction 
methods are described in WO 90/11092 and U.S. Pat. No. 5,580,859. 

Further non-viral delivery suitable for use includes mechanical delivery 
systems such as the approach described in Woffendin et al,, Proc. Natl. Acad. 
5 Sci. USA (1994) 91(24): 11581-11585. 

The surprisingly intricate relationship between the gene sequences as 
disclosed herein and the development of malignancy as imraveled by the 
inventors signifies the presence of novel routes of cancer development. 

Due to this finding the identification of additional risk factors, novel 
10 therapeutic interventions and pharmaceuticals and the treatment and 
prophylaxis of cancers, and in particular of leukemia has now become 
available, which will ultimately result in a reduction in occurrence and/or 
progression of this disease as well as improved metrhods for diagnosis and 
prognosis. 

15 The present invention now also provides for a method of treatment 

comprising modulating the expression of gene sequences as described herein. 

A further object of the present invention is to provide for coordinated 
design and discovery of new drugs for the treatment of cancers, and in 
particular of leukaemia, and related diseases as well as providing compositions 

20 comprising modulators of such genes or their expression products which can 
serve as a basis or an ingredient of a phsurmaceutical composition. The present 
invention therefore also relates to pharmaceutical products that comprise such 
modulating compositions. As a farther object, the present invention provides 
the use of at least one modulator for the manufacture of a medicament for the 

25 treatment and/or prevention of cancers, and in particular of leukemia, and/or 
related disease. 

The present invention will now be illustrated by way of the following 
non limiting examples. 



30 



EKAMPLE 1 
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Identij5.cation of common viral insertion (CIS) 

INTRODUCTION 

To identify common viral integration sites in moiise txunors and tximor 
5 derived ceU lines, the MnLV virus was used. Mice were either infected with 
murine leukemia virus 1.4 (Graffi-1.4 MuLV) or with Cas-Br-M MuLV. The 
Graffi-MuLV is an ecotropic retroviral complex causing leukemias in mice. 
This viral complex does not contain oncogenic sequences itself but rather 
deregulates genes due to proviral integrations. Graffi-1.4 MuLV is a subclone 
10 of this complex and predominantly induces myeloid lexikemias. NIH/Swiss 

mice infected with Cas-Br-M MuLV develop myeloid or lymphoid malignancies 
also as a result of retroviral insertion that affect target genes. 

MATERIALS AND METHODS 

15 1. Induction of leukemias 

Newborn FVB/N mice or NIH/Swiss were injected subcutaneously with 
100 \il of a ceU c\ilture supernatant of Graf&-1.4 MuLV or Cas-Br-M MuLV 
producing NIH3T3 cells, respectively. Mice were checked daily for symptoms of 
illness, i.e., apathy, white ears and tail, impaired interaction with cage-mates, 

20 weight-loss, and dull fur. T5^ically, leukemic mice suffered from enlarged 
spleens, hvers, thymuses, and lymph nodes. From these primary tumors, 
chromosomal DNA was isolated for PCR-based screening. Blood samples were 
taken from the heart. For morphological analysis, blood smears and C3^spms 
were fixed in methsoiol, May-Griinwald-Giemsa (MGG) stained and analyzed. 

25 Single-cell suspensions of different organs were analyzed by flow 

c3rtometry using a flow cytometer. The cells were labeled with the following rat 
monoclonal antibodies: ER-MP54 (ER-MP54), ER-MP58 (ER-MP58), Ml/70 
(Mac-1), F4/80 (F4/80), RB68C5 (GR-1), ER.MP21 (transferrin receptor), 
TER119 (Glycophorin A), 59-AD2.2 (Thy-1), KT3 (CDS), RA3 6B2 (B220) and 
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E13 161-7(Scal). Immunodetection was performed utilizing a Goat-anti-Rat 
antibody coupled to fluorescein isothiocyanate (G_Ra-Fitc) 

2. Inverse PGR on Graffi-1.4 MuLV induced leukemias 

5 Genomic DNA from the primary tumors was digested with Hhal. After 

ligation, a first PGR was performed using Graffi-1.4 MuLV (LTR) specific 
primers LI (5' TGCAAGATGGCGTTACTGTAGGTAG 3') (SEQ ID NO: 1) and 
L2 (5' CCAGGTTGCCCCAAAGAGCTG 3') (SEQ ID NO: 2) (cycUng conditions 
were 1 min at 94°C, 1 min at 65°C, 3 min at 72°G [30 cycles]). For the second 

10 nested PGR the primers LIN (5' AGGGTTATGGTGGGGTGTTTG 3') (SEQ ID 
NO: 3) and L2N (5' AAAGAGGTGAAAGGAGCTTGC 3') (SEQ ID NO: 4) (15 
cycles) were used. The PGR reaction mixture contained 10 roM Tris-HGl (pH 
8.3), 50 mM KGL, 1.5 mM MgCk, 200 p.M dNTFs, 10 pmol of each primer and 
2.5. units of Taq-polymerase. The PGR firagments were analyzed on a 1% 

15 agarose gel and cloned in a TA cloning vector according to standard 

procedures. PGR products were sequenced with the M13 forward primer 5* 
GAGCGGGAGGAAAATG 3' (SEQ ID NO: 5) and M13 reverse primer 5' 
GAGGAAAGAGGTATGAG 3» (SEQ ID NO: 6). Virus flanking genomic 
sequences were identified using the National Genter for Biotechnology 

20 Information (NGBI) and Gelera databases 

3. Inverse PGR and real-time PGR on Cas-Br-M MuLV leukemias 

Five p-g of genomic DNA was digested with Sau3A or with SstL The 
products were treated with T4-ligase, which resulted in the formation of 

25 circularized products. Subsequently, an inverse PGR (ICPR) strategy was used 
with primers specific for the Cas-Br-M MuLV LTR. For the SauSA 
digested/Ugated firagments, the first PGR reaction was carried out with 
primers pLTR4 (5'-CCG AAA GTG GTT ACG ACA- 3') (SEQ ID NO: 7) and 
pLTR3 (5'-GGT GTC GTG AGA GTG ATT-3') (SEQ ID NO: 8), foUowed by a 

30 nested PGR using pLTR5 (5'-AGC AGA GAT ATG GTG TTT.30 (SEQ ID NO: 9) 
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and pLTR6 (5'-GTG ATT GAG TAG CCG GTT-3') (SEQ ID NO: 10). Cycle 
conditions for both PCRs were 15" at 94^*0, 30" at 57°C, and 2' at 72^0 for 10 
cycles, and an additional 20 cycles following the conditions 15** at 94°C, 30" at 
57°C, and 2'30" at 72^0. Reactions were performed using Expand High Fidelity 
5 PGR System. In case of SstI digested genomic DNA, the circularized DNA was 
amplified using primers pLTR9 (5'-GAC TCA GTG TAT GGG AGG AC-3') 
(SEQ ID NO: 11) and pLTRl (5'-CTT GGT GTT GCA TGC GAG TGG-3') (SEQ 
ID NO: 12), and pLTRlO (5'-GTG AGG GGT TGT GTG CTC-30 (SEQ ID NO: 
13) and pLTR2 (5'-GTC TCG GTG TTC GTT GGG AGG-3*) (SEQ ID NO: 14), 

10 respectively. The first PGR was performed for 30 cycles 30" at 94^C, 1' at 60°C, 
and 3' at 72°G. The reaction was carried out with Expand High Fidelity PGR 
system. Nested PGR conditions were 30 cycles of 30" at 94°C, V at SS'^C, and V 
at 72°G. This reaction was performed with Taq polymerase. 

. For RT-PGR, total RNA was isolated through a CsCl gradient. First 

15 strand cDNA was obtained by reverse transcriptase (RT) reactions with an 

oUgo(dT)-adapter primer (5'-GTG GGG AAT TGG TGG AGG GG(dT)i6-3') (SEQ 
ID NO: 15) at 37**C with 5p.g KNA, using the Superscript™ PreamplifLcation 
System. Subsequently, PCRs (1' at 94°C, 1* at 58*'C, 3' at 72^0 (30 cycles)) were 
performed on the RT reactions of the leukemias by using the LTR specific 

20 primer pLTR6 and the adapter primer (5'-GTC GGG AAT TCG TCG ACG CG- 
3') (SEQ ID NO: 15). PGR products were directly cloned into pCR2.1 and 
subjected to nucleotide sequencing with the M13 forward primer 
5'GACCGGCAGCAAAATG 3' (SEQ ID NO: 5) and ml3 reverse primer 
5'CAGGAAACAGGTATGAC 3' (SEQ ID NO: 6). Nucleotide sequences were 

25 compared to the NCBI and Celera databases for analysis. 

Results 

1. Gra£fi-1.4 MuLV induced leukemias 

Leukemias developed 4 to 6 months after subcutaneous injection of 
30 newborn FVB/N mice with Grafa-1.4 MuLV. Forty-eight of the 59 leukemias 
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(81%) analyzed exhibited morphological characteristics of myeloid cells. Blast 
cell percentages in the bone marrow ranged from 24 to 90% with an average of 
48%. Leukemia cells expressed immunophenotypic marker profiles consistent 
with their myeloid appearance, e.g., ER-MP54+, ER-MP58+, CD3-, GR-1+. Six 
5 leukemias with blast-Hke morphology showed no immunophenotypic 
differentiation markers, suggesting that these tumors represented very 
immature leukemias. Only 3 leukemias were of T-lynaphoid origin 
(CD3+/MP58-/Thyl-f) and 2 showed mixed myeloid and erythroid features 
(Terll9+/ER-MP58+/F4/80+). These results demonstrate that Graffi-1.4MuLV 

10 infection predominantly induce myeloid leukemia in FVB/N mice. 

The provirus flanks were cloned, subjected to nucleotide sequencing, 
and blasted against the Celera and NCBI databases resulting in the 
identification of common insertion sites. Of the genes identified, the ones that 
were so far not described to be involved in tumor development are listed in 

15 Table 1 combined with the novel cancer genes identified from Cas-Br-M MuLV 
induced letikemias . 

2. Cas-Br-M MuLV induced leukemias 

Cas-Br-M MuLV injected newborn NIH/Swiss mice developed leukemias 
20 by approximately 140 to 400 days postinoculation. Most of these were myeloid 

leukemias (59%), although T-cell (21%), and mixed T-cell/myeloid (21%) 

leukemias were found. 

To clone viral integration sites, a virus-LTR (long terminal repeat) 

specific inverse PGR as well as RT-PCR were applied as complementary 
25 approaches using DNA or RNA fi:om 35 myeloid leukemias. The inverse PGR 

method was carried out on 19 primary leukemias and 9 cell lines, whereas the 

RT-PGR based technique was performed on 12 cell lines and 2 primary 

leukemias. 

The provirus flanks were subjected to nucleotide sequencing, blasted 
30 against the Celera and NCBI databases res\dting in the identification of 
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common insertion sites. Of the genes identified, the ones that were so far not 
described to be involved in timior development are listed in Table 1 combined 
with the novel cancer genes identij&ed from Graffi-1.4 MuLV induced 
leukemias. 

5 

DISCUSSION 

Although proviral integrations occur randomly, they may affect the 
expression or function of nearby genes. If a gene is affected in two or more 
independent tumors, this indicates that these integrations provide a selective 

10 advantage and therefore contribute to tumor development. Multiple of these 
common insertion sites were identified of which a large number are 
demonstrated for the first time to play a role in cancer. Importantly, several of 
the other genes identified sire well-known cancer genes validating the 
approach. This example shows that the pm-sued strategy can be successfully 

15 used to identify novel genes that are involved in tumor development. 

EXAMPLE 2 

Definition of novel routes for development of AML 

MATERIAL AND METHODS 
20 6raffi-l,4 MuLV-induced leukemias 

Newborn mice were injected subcutaneously with 100 id of cell culture 

supernatant of Graffil.4-MuLV producing NIH3T3 cells (a gift firom Dr. E. 

Rassart, Department des Siences Biologiques, Universite du Quebec a 

Montreal, Montreal, Quebec, Canada). Mice were treated and analyzed for the 
25 development of leukemia as previously described (Erkeland, et ah, 2003, Blood 

101:1111-7). Chromosomal DNA was isolated from the leukemic cells for PCR- 

based screening (Erkeland, et al, 2003, Blood 101:1111-7). 



Cytological analysis and immunophenotyping of the leukemic cells 
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For morphological analysis, blood smears and cytospins were fixed in 
methanol, stained with May-Griinwald-Giemsa, and examined using a Zeiss 
Axioscope microscope (Carl Zeiss BV, Weesp, The Netherlands). Single-cell 
suspensions of different organs were analyzed by flow cytometiy using a 
5 FACScan flow cytometer (Becton Dickinson and Co, Mo:unt»ia. View, CA, 

USA). Cells were labeled as described previoiisly (Joosten et al., 2000. Virology 
268:308-18) with the following rat monoclonal antibodies: ER-MP54, ER- 
MP58, Ml/70 (Mac-1), F4/80, RB68C5 (GR-l), ER-MP21 (transferrin receptor), 
TER119 (Glycophorin A), 59-AD2.2 (Thy-1), KT3 (CDS), RA3 6B2 (B220) and 
10 E13 161-7 (Seal). Immunodetection was performed using goat-anti-rat 

antibodies coupled to fluorescein isothiocyanate (GotRa-FITC) (Nordic, Tilburg, 
The Netherlands). 

Inverse PGR on Graffi-1.4 MuLV induced leukemias 
15 Genomic DNA from the primary tumors was digested with Hhal 

(CGCG). After circularization by ligation (Rapid ligation kit, Roche 
Diagnostics, Maimheim, Germany), a first PGR was performed using Graffi-1.4 
MuLV (LTR) specific primers LI and L2 as described in Example 1. Cycling 
conditions were 1 min at 94*=*C, 1 min at 62**C and 3 min at 72*^0 for 30 cycles. 
20 For the second nested PGR, the primers LIN and L2N as described in Example 
1 (15 cycles) were used. The PGR reaction mixture contained 10 mM Tris-HGl 
(pH 8.3), 50 mM KCL, 1.5 mM MgCh, 200 mM dNTFs, 10 pmol of each primer 
and 2.5 units of Taq-polymerase (Pharmacia, Uppsala, Sweden). The PGR. 
firagments were analyzed on a 1% agarose gel. 

25 

Detection of virus integrations by specific nested PGR 

To determine the localization of the Graffi-1.4 provirus in particular 
virus-targeted genes in an extended panel of leukemias, a nested PGR was 
performed on DNA from primary timiors. For the first PGR, virus integration 
30 site (VIS) locus specific primers XI and X2 were used in combination with 
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Gra£&-1.4 MuLV LTR specific primers Ll and L2 (see also Figure 1). Cycling 
conditions were 1 min 94°C, 1 min 62 °C, 3 min 72°C for 30 cycles. For the 
second PGR, nested VIS-specific primers XIN and X2N (i.e. genomic region 
specific primers) were used in combination with nested LTR specific primers 
5 LIN and L2N xmder the same conditions. The obtained PGR products were 
analyzed by Southern blotting. To verify the correct nature of the amplified 
bands, the blots were hybridized with radiolabeled gene specific probes PI and 
P2 at 45°G in Ghurch buffer (0.5 M phosphate buffer, pH 7.2, 7% (w/v) SDS, 10 
mM EDTA) overnight. Signals were visuahzed by autoradiography according 
10 to standard procedures (Maniatis ef aZ., 1982, Molecular Cloning, A Laboratory 
Manual. Cold Spring Harbor Laboratory, Cold Spring Harbor, New York). 

Nucleotide sequence analysis 

PGR products were sequenced using an ABI 3100 sequencer (Applied 
15 Biosystems, Nieuwerkerk a/d IJssel, The Netherlands) with the GrafFi-1.4 
MuLV specific forward primer Ll. Virus flanking genomic sequences were 
analyzed using GenBank (National Center for Biotechnology Information), 
Celera Discovery System (Gelera Genomics, Rockville, MD, USA) (Hogenesch 
et al., 2001, Cell 106:413-5; Kerlavage et ah, 2002, Nucleic Acids Res. 30:129- 
20 36) and Ensembl (Wellcome Trust Genome Campus, Hinxton, 

Cambridgeshire, UK) (Hubbard et al, 2002. Nucleic Acids Res. 30:38-41). 

Results 

GrafXi-1.4-induced leukemia 

25 Eighty-nine newborn FVB/N mice were inoculated with Graf5-1.4. When 

moribund, mice were sacrificed and hematopoietic organs were isolated. Six 
mice died without signs of leukemia and were excluded &om further 
investigation. Standard blood cell analysis was performed and values 
compared with the mean of 10 normal FVB/N mice (Joosten et al, 2002, Exp. 

30 Hematol. 30:142-9), Most of the leukemic mice had increased nxmibers of 
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peripheral white blood cells and decreased numbers of platelets and red blood 
cells compared to normal controls (data not shown). Blast percentages in the 
bone marrow ranged from 24 to 90% with an average of 48%. Leukemic cells 
from 76 mice were immunophenotyped. The major immunophenotypic features 
5 of these leukemias are given in Table 2. 



Table! 





Leukemia type 


Immunophenotype ' 


No. of 
leukemias 


I 


T-lymphoid 
Markers 


MP21^ CD3*, Thyl* 


2 


n 


Mixed lymphoid, 
erythroid, myeloid 
differentiation 
markers 


Grl"", F4/80*, Macl% Imm*(a), CD3*, 
B220*, (gcsfryb), (TerllP"^ 


12 


m 


Myeloid 
differentiation 
Markers 


Imm\ MP21*, («^/80*,Grl^ B220^ Mac- 


43 


IV 


myelo-monocytic 
blasts 


Imm", Grr, gcsfr, (F4/800,(Macl+), 
(B220*) 


15 


V 


Erythroid 


Terlir, MP2r (Seal") 


4 


Total 






76 



aCImm*^) indicates positive staining for immature hematopoietic cell markers 



Scal+, MP58+, MP54+, (Thyl+). ^Markers between brackets are not 
10 consistently expressed on all individual tumors. 

Based on these criteria, the leukemias were classiEied as T-lymphoid, 
mixed lymphoid/erythroid/myeloid, myeloid, myelo-monocjrtic or erythroid. 
Fifty-nine leukemias were analy^d morphologically. Almost all of the mice 
15 showed splenomegaly of which 25% thymus enlargement, 20% with lymph 

node enlargement and 55% Uver involvement. In 5% of the mice, leukemic ceUs 
were present in the BM and blood, without overt peripheral organ 
involvement. 

20 Virus integration sites in Gr-1.4 induced leukemias 
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PGR analysis in conjunction with database searches identified a total of 
94 different virus integration sites out of 69 tumors, 38 of which had been 
found previously in other studies. Examples of this latter group of CIS are p53, 
Notch-1, Evi-1, NFl (Evi-2), Lck-1, Pim-1, HoxA9 (Evi-6), Fli-1 and N-Myc. 
5 Notably, 79 of the 94 integrations are CIS directly implicating the affected 
genes in leukemic transformation. The remaining 15 have been included in 
this report because closely related family members of these genes were found 
in other studies. Fifty-six of the identified sequences were mapped near or in 
novel candidate leukemia genes. The products of the affected genes have been 

10 classified in the following categories: receptors and signaling molecules (Table 
3), regulators of transcription (Table 4) and the remaining group of gene 
products with regulatory roles in other pathways (e.g., DNA stability and 
proteasomal targeting) or with currently unknown functions (Table 5). 

Approximately 15% of the virus integrations identified in this inverse 

15 , PCR-based screen were determined to be common (more than 2 integrations in 
a particular genomic locus) in the initial screen. To determine the frequency of 
common integrations more sensitively, we performed integration specific PGR 
reactions on 49 Gr-1.4 target genes (tables 3-5) as described elsewhere 
(Erkeland, et aL,. 2003, Blood 101:1111-7; Joosten et al, 2002, Oncogene 

20 21:7247-55) of which most (96%) appeared to be common. The genes flanking 
the virus integration sites were compared with data firom the National Cancer 
Institute retroviral tagged cancer gene database (RTCGD) at 
http://genome2.ncifcrf.gov and other sources (Table 3). The Gr-1.4 integrations 
that are present in this database or were reported in other studies have been 

25 marked with a reference in Tables 3, 4 and 5. Family members of Gr-1.4 
targeted genes found in other studies are listed in the column "identified 
family member". Most of the virus integrations occurred in or near 5' or 3' ends 
of the genes, suggesting that expression levels of these genes are deregulated 
as a consequence of virus integration. Eighteen CIS were exclusively located 

30 within the gene, conceivably causing gene disruption. This group comprises the 
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previously reported virus targets Notch-1, NFl, PTPNl, Inpp4a, p53, SWAP70 
and Kcnk5 and 11 new genes: calcyphosine, phosphodiesterase -1, ELL, NCOR- 
1, HDAC-7A, histone H3.3A, api-5, RIL, D6Mni5e and two imknown genes. 

5 Discussion 

In this study, we have employed in vivo retroviral mutagenesis using 
the Grafifi-1.4 (Gr-1.4) virus complex to identify novel routes for the 
pathogenesis of acute myeloid leukemia (AML) by identifying the disease 
genes specifically involved therein. In comparable studies with other virus 

10 strains, e.g., Moloney, AKXD or Cas-Br-M, the most prominently appearing 
tumor types are T and B cell lymphomas and in case of BXH2 myelomonocytic 
tumors. In contrast, more than 80% of the leukemias induced by Gr-1.4 
xmequivocally exhibited immunophenotypic characteristics of myeloid cells, 
with immature morphological features (mainly myeloblasts), emphasizing the 

15 imique features of the Gr-1.4 virus complex as a tool for characterizing 
pathogenetic mechanisms in myeloid disease (see table 2). 

Indeed, the majoriiy of the CIS described here have thus far not been 
reported in the extensive screens in the lymphoma models, although some 
overlap was observed. The latter is not sxirprising in view of the fact that cell 

20 type specific events usually impinge on downstream common regulatory 
pathways that can be affected in mxiltiple tumor types. 

Although most of the Gr-1.4 CIS have been Unked to candidate disease 
genes based on their proximity to these genes, proviral integrations have been 
reported to influence gene expression over distances of more than 100 kb, 

25 Thus, it cannot be excluded that genes located more distantly firom the CIS are 
also deregulated and that multiple genes are affected by a single CIS. In 
addition, three other aspects of the retroviral screens as they are currently 
being performed must be emphasized. First, malignancies induced by 
replication competent retroviruses are usually oligo- or polyclonal rather than 

30 monoclonal. Although this gives rise to high frequencies of CIS in relatively 
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small cohorts of mice, it complicates the search for cooperating events within 
one leukemic clone. Second, the sensitive PCR-based techniques used to 
identify virus integrations do not allow distinction between CIS present in a 
majority of the leukemic cells, initiating an early pathogenic event, and CIS 

5 present in only a minor population of cells, probably affecting leukemia 
progression genes. Finally, a recent study has emphasized that murine 
leukemia viruses have a preference for integration near transcriptional start 
sites (Wu, et al., 2003, Science 300:1749-51). Our present data with Gr-1.4 
corroborate this conclusion and indicate that retroviral mutagenesis with 

0 MuLV preferentially, although not exclusively, identifies gene deregulation 
rather than gene disruption. 

In conclusion, using high-throughput retroviral screens with the Gr- 
1.4 virus complex we have identified novel pathways involved in myeloid 
leukemia. Currently, we are studying the consequences . of aberrant gene 

5 expression or function employing gene transfer methodology in 32D cells and 
primary bone marrow cxiltures. 

Conclusion 

Acute myeloid leukemia (AML) is a heterogeneous group of diseases, 
0 in which chromosomal aberrations, small insertions/deletions, or point 
mutations in certain genes have profound consequences for prognosis. 
However, the majority of AML patients present without currently known 
genetic defects. Retroviral insertion mutagenesis in mice has become a 
powerful tool to identify new disease genes involved in the pathogenesis of 
5 leukemia and lymphoma. Here we have used the Graf&-1.4 murine leukemia 
virus (Gr-1.4) strain, which causes predominantly acute myeloid lexikemias, in 
a screen to identify novel genes involved in the pathogenesis of this disease. 
We report 79 candidate disease genes in common virus integration sites (CIS) 
and 15 genes of which family members were previously foxmd to be affected in 
0 other studies. The majority of the identified sequences (60%) have not been 



wo 2004/016317 



PCT/NL2003/OOOS83 



73 

foxmd in previous screens in l3nnphomas and monocytic letikemias, suggesting 
a specific involvement in AML. Although most of the virus integrations 
occurred in or near the 5* or 3* ends of the genes, suggesting deregulation of 
gene expression as a consequence of virus iategration, 18 CIS were exclusively 
5 located within the gene, conceivably causing gene disruption. 

EXAMPLE 3 

Large scale identification of novel potential disease loci in mouse leukemia 
MATERIAL AND METHODS 
Cas-Br-M MuLV-induced leukemias 

The NIH-Swiss derived Cas-Br-M MuLV-induced primary tumors used 
in this study are: for IPCR: CSL 13, 16, 20, 22, 26, 30, 31, 32, 33, 65, 71, 78, 82, 
90, 91, 93, 111, 117, 123, 201, 203, 204, 212, 221, 227, 228, 237 and 239; and 
for RT-PCR: CSL: 201 and 203 (Joosten et aL 2000. Virology 268:308-18). Cells 
isolated from leukemic spleens were stored vitally in ahquots in liquid 
nitrogen. The following leukemia cell Hnes were utihzed in this study: for 
IPCR: DA 24, and NFS 22, 36, 56, 58, 60, 61, 78, and 124; and for RT-PCR: DA 
1, 2, 3, 8, 33, and NFS 22, 36, 58, 61, 78, 107 and 124 (Valk et al, 1997 Blood 
90:1448-1457). These cell lines were cxdtured in RPMI plus 10% fetal calves' 
serum (FCS) (Gibco Life Technologies Inc., Gent, Belgium) and 10 Units of 
mouse IL3. The frequency of virus integrations was determined both on cell 
lines, and primary leukemias. 

IPCR and RT-PCR 

Isolation of genomic DNA was carried out exactly as described 
previously (Valk et al, 1999 J. Virol. 73:3595-602). Five pg of genomic DNA 
was digested with Sau3A, Pvull or Sstl (GIBCO Life Technologies Inc., Gtent, 
Belgixun). The products were treated with T4-hgase (GIBCO Life Technologies 
Inc.), which resulted in the formation of circularized products. Subsequently 
we performed an IPCR strategy using primers specific for the Cas-Br-M MuLV 
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LTR. For the SauSA digested/ligated fragments, the first PGR reaction was 
carried out with primers pLTR4 (5'-CCGAAACTGCTTACCACA-3') (SEQ ID 
NO. 7) and pLTR3 (5'-GGTCTCCTCAGAGTGATT-30 (SEQ ID NO. 8), foUowed 
by a nested PGR using pLTR5 (5*-ACCACAGATATCGTGTTT-3') (SEQ ID NO. 
5 9) and pLTR6 (5*-GTGATTGACTAGCCGCTT-3') (SEQ ID NO. 10). Cycle 

conditions for both PGRs were 15 s at 94°C, 30 s at ST'^^G, and 2 min at 72°G for 
10 cycles, and an additional 20 cycles following the conditions 15 s at 94*^0, 30 
s at 57°G, and 2 min 30 s at 72°C. Reactions were performed using Expand 
High Fidelity PGR System (Roche, Mannheim, Germany). For PvuII and SstI 

10 digested genomic DNA, the circularized DNA was amplified using primers 

pLTR7 (5'-GAGTCAGTCTATCGGAGGAG-3') (SEQ ID NO. 11) and pLTRI (5'- 
GTTGCTGTTGGATGGGAGTGG^3') (SEQ ID NO. 12), and pLTR8 (5'- 
GTGAGGGGTTGTGTGGTC-3') (SEQ ID NO. 13) and pLTR2 (5»- 
GTGTCGGTGTTCGTTGGGAGG-30 (SEQ ID NO. 14) respectively. 

15 The first PGR was performed for 30 cycles 30 s at 94°G, 1 min at 54°G, 

and 3 min at 72°G. Reactions were carried out with Expand High FideUty PGR 
system (Roche). Nested PGR conditions were 30 cycles of 30 s at 94^0, 1 min at 
58**G, and 1 min at 72°G. This reaction was performed with Taq pol3rmerase 
(Amersham Pharmacia Biotech, Roosendaal, The Netherlands). RT-PCR was 

20 carried out as described previously by Valk et al. (1997c). Briefly, total RNA 
was isolated through a CsGl gradient. First strand cDNA was obtained by 
reverse transcriptase (RT) reactions with an oligo(dT)-adapter primer (5'- 
GTCGCGAATTCGTCGACGCG(dT)i5-30 (SEQ ID NO. 15) at 37^C with 5 fig 
RNA, using the Superscript M Preamphfication System (Life Technologies, 

25 Breda, The Netherlands) according to the instructions of the manufacturer. 
Subsequently, PGRs (1 min at 94''G, 1 min at 58**C, 3 min at 72^*0 (30 cycles)) 
were performed on the RT reactions of the leukemias by using the LTR specific 
primer pLTR6 and the adapter primer (5'-GTCGGGAATTGGTGGACGGG-3') 
(SEQ ID NO. 15). PGR products were directly cloned into pCR2.1 (Invitrogen, 

30 Breda, The Netherlands) according to the instructions of the manufacturer and 
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subjected to nucleotide sequencing. Nucleotide sequences were compared to the 
NCBI database for analysis. 

PGR analysis and Southern blot hybridization 
5 One ug of genomic DNA isolated from primary leukemias or cell lines 

was subjected to PGR with primer pLTEl and a locus specific primer A (Figure 
2). Locus specific primers of 17-21 nucleotides were designed specific for each 
of the sequences of the fragments generated by Inverse- or RT-PGR. Primers 
were piurchased from Life Technologies. One \il of PGR product was transferred 
to a nested PGR reaction using primer pLTR2 and a nested locus specific 
primer B. Cycle conditions for both reactions were 1 cycle 5 min at 94°C, 30 
cycles 30 s at 94*'G, 1 min Tm, 1 min 30 s at 72^G, 1 cycle 5 min at 72**C. Tm 
was specific for each primer and was between 48°G and 62°G. PGR was carried 
out using Taq polymerase (Amersham Pharmacia Biotech). PGR products were 
electrophoresed in a 1.5% agarose gel and transferred to Hybond-N**^ nylon 
membranes (Amersham Pharmacia Biotech) with 0.25 M NaOH/1.5 M NaGL 
Membranes were hybridized with a 32p-end-labeled locus specific primer G. 
Labehng was carried out using T4kinase (USB, Gleveland, OH, USA) according 
to the instructions of the provider. Subsequently, blots were stripped in 0.4 M 
NaOH for 30 min at 45**C, neutralized using 0.2 M Tris-HGl, 0.2% SDS, and 
0,1 X SSC for 15 min at 45°C and hybridized with s^P-end-labeled Cas-Br-M 
MuLV specific primer pLTRS. Blots were exposed for autoradiography with a 
KODAK film and an intensifying screen. After 15 min of exposure films were 
developed and analysed. 

Sequencing analysis 

Samples were prepared using the Bd-sequencing kit according to 
instructions from the provider (PE Biosystems, Nieuwerkerk a/d IJssel, The 
Netherlands) and nucleotide sequencing was carried out on an ABI 310 
automatic sequencer (PE Biosystems) using primers M13 forward (5'- 
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GTAAAACGACGGCCAGT-30 (SEQ ID NO. 24) and M13 reverse (5*- 
GGAAACAGCTATGACCATG-30 (SEQ ID NO. 25). Sequences isolated by 
IPCR or RT-PCR were compared to the data present in the Celera Discovery 
System (Celera Genomics, Rockville, MD, USA, database contents May 2002) 
and in the GenBank of the National Center for Biotechnology Information 
(NCBI, Bethesda, MD, USA). The exact site of integration was determined. For 
insertions found outside a gene, the distance between the integration and the 
most nearby gene was calculated. The location on the mouse chromosome was 
established and the hxnnan equivalent was deduced by using the human 
databases of Celera Discovery System (May 2002), LocusLdnk or human mouse 
homology maps (both www.ncbi.nlm.nih.gov). 

Results 

Appljdng virus LTR-specific inverse -PCR and RT-PCR combined 
with automated sequencing on CasBR-M MuLV induced myeloid leukemias, 
126 virus integration sites were cloned. Using locus- and LTR-specific primers, 
a nested PCR/Southern-blotting procedure was developed on genomic DNA 
from a large panel of MuLV-induced leukemias, to analyse whether a 
particular virus insertion represented a common virus integration site (cVIS). 
In fact 39 out of 41 integrations analysed this way represented a common viral 
integration site. We recognized six previously cloned cVISs, i.e. Evil, HoxaT, 
cMyb, Cb2/Evill, Evil2, and Hisl and 33 novel common insertions. Among 
this group we found integrations in or near genes encoding nuclear proteins, 
e.g. Dnmt-2, Nm23-M2, Ctbpl or Erg, within receptor genes, e.g. Cb2 or mrcl, 
novel putative signaling or transporter genes, the ringfinger-protein gene 
Midi and a panel of genes encoding novel proteins with unknown function. 
The finding that 39 out of 41 integrations analysed represented a cVIS, 
suggests that the majority of the other virus insertions that were not yet 
analysed by the PCR/Southern-blotting method are located in e cVIS as well 
and may therefore also harbor novel disease genes. 
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Claims 

1. Use of at least one miuine genomic region involved in the development 
of cancer listed in Table 1 or its hxmian homologue for the preparation of 
polypeptide encoded by said region. 

2. Use of at least one mxirine genomic region listed in Table 1 or its human 
5 homologue for the preparation of an inhibitor capable of inhibiting the 

transcription product or activity of a polypeptide encoded by said region or 
affected by transformations in said region. 

3. Use according to claim 2, wherein said inhibitor is a small molecule 
interfering with the biological activity of the polypeptide encoded by said 

10 genomic region or with the biological activity of a polypeptide the expression of 
which is affected by transformations in said genomic region. 

4. Use according to claim 2, wherein said inhibitor is an antibody. 
* . 5. Use according to claim 2, wherein said inhibitor is an antisense 

molecule, in particular an antisense RNA or antisense oligodeoxynucleotide. 
15 6. Use according to claim 2, wherein said inhibitor is an RNAi molecule. 

7. Use according to claim 2, wherein said inhibitor is a ribozyme. 

8. Isolated nucleic acid sequence comprising at least one murine genomic 
listed in Table 1 or its human homologue. 

9. Isolated nucleic acid sequence according to claim 8, further comprising 
20 one or more regulatory sequences. 

10. Recombinant vector comprising the nucleic acid sequence of claim 8 or 9. 

11. Recombinant host cell comprising the vector of claim 10. 

12. Recombinant host cell according to claim 11, wherein said host cell is a 
eukaryotic host cell 

25 13. A eukaryotic host cell according to claim 12, wherein said eukaryotic 
host ceU is a mammalian stem cell. 
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14. A mamniaKan stem cell according to claim 13, wherein said mammalian 
stem cell is a hematopoietic stem cell. 

15. Inhibitor compound capable of inhibiting the transcription product or 
poljrpeptide encoded by at least one mxuine genomic region listed in Table 1 or 

5 its human homologue or capable of inhibiting the transcription product or 
polypeptide the expression of which is affected by a transformation in said 
genomic region. 

16. Inhibitor compoimd according to claim 15, wherein said inhibitor 
compoxmd is an antibody or derivative thereof directed against said 

10 pols^peptide. 

17. Inhibitor compound according to claim 16, wherein said polypeptide is 
expressed on the cell membrane. 

18. Inhibitor compound according to claim 16 or 17, wherein said derivative 
is selected from the group consisting of scFv fragments. Fab fragments, 

15 chimeric antibodies, bifunctional antibodies, intrabodies, and other antibody- 
derived molecules. 

19. Inhibitor compound according to claim 15, wherein said iahibitor 
compound is a small molecule interfering with the biological activity of said 
polypeptide. 

20 20. Inhibitor compound according to claim 15, wherein said inhibitor 
compound is an antisense molectile, in partictilar an antisense RNA or 
antisense oligodeoxynudeotide. 

21. Inhibitor compound according to claim 15, wherein said inhibitor 
compound is an RNAi molecule. 
25 22. Inhibitor compound according to claim 15, wherein said inhibitor 
compound is a ribozyme. 

23. Inhibitor compound according to any one of the claims 15-22, for use in 
the treatment of cancer. 

24. Inhibitor compound according to claim 23, wherein said cancer is 
30 leukemia, preferably acute myeloid leukemia (AML). 
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25. Use of an inhibitor compoxind according to any one of the claims 15-24, 
for the preparation of a pharmaceutical composition for the treatment of 
cancer. 

26. Use according to claim 25, wherein said cancer is leukemia, preferably 
5 acute myeloid leukemia (AML). 

27. Use according to claim 25 or 26, wherein the treatment comprises gene 
therapy. 

28. Use of an inhibitor compound according to any one of the claims 15-22, 
for the preparation of a pharmaceutical composition for treatment of 

10 inflammatory diseases. 

29.. Pharmaceutical composition for the treatment of cancer, comprising at 
least one inhibitor compound according to any one of the claims 15-24 and a 
smtable excipient, carrier or diluent. 

30. Method of treating a mammal, comprising administering to a mammal 
15 the pharmaceutical composition of claim 29 in an amoxmt effective to alleviate 

or prevent the formation of cancer, preferably in an amoimt effective to provide 
remission or prevention of relapse of sohd tumors. 

31. Method according to claim 30, wherein said cancer is leukemia, 
preferably acute myeloid leukemia. 

20 32. Method of treating a mammal, comprising administering to a mammal 
the hematopoietic stem cell of claim 14 in an amount effective to alleviate or 
prevent the formation of leukemia, preferably acute myeloid leukemia. 

33. Use of at least one murine genomic region listed in Table 1 or its human 
homologue or a transcription product thereof or polypeptide encoded thereby 

25 for the preparation of a diagnostic reagent for diagnosis of cancer. 

34. Use according to claim 33, wherein said cancer is leukemia, preferably 
acute myeloid leukemia (AML). 

35. A diagnostic reagent capable of specifically binding to a miurine genomic 
region listed in Table 1 or its human homologue or a transcription product 

30 thereof or polypeptide encoded thereby. 
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36. A diagnostic reagent, according to claim 35, wherein said diagnostic 
reagent is an antibody or a derivative thereof. 

37. A diagnostic reagent, according to claim 35 or 36, wherein said 
diagnostic reagent is a nucleic acid probe. 

5 38. A diagnostic composition comprising a diagnostic reagent according to 
any one of claims 35-37. 

39. Use of a diagnostic composition according to claim 38, for the diagnosis 
of cancer. 

40. Use according to claim 39, wherein said cancer is a solid tumor of lung, 
10 colon, breast, prostate, ovarian or pancreas cancer. 

41. Use according to claim 39, wherein said cancer is leiikemia, preferably 
acute myeloid leukemia (AML). 

42. Use according to any one of claims 39-41, wherein the diagnosis is 
performed by means of histological analysis of tissue specimens using specific 

15 antibodies directed against encoded pol3T)eptides, using in-situ hybridisation 
analysis of gene expression levels in tissue specimens with RNA probes 
directed against gene sequences or using polynucleotide or oligonucleotide 
arrays. 

43. A kit of parts for implementing a use of any one of claims 39-41, 
20 comprising diagnostic composition according to claim 38, and one or more 

reagents selected form the group consisting of reagents for the isolation of 
nucleic acid fragments £rom a sample, reagents for the isolation of polypeptides 
£rom a sample, reagents for immxmostaining of a sample, reagents for in situ 
hybridisation of a sample and reagents for performing nucleic acid array 
25 hybridisations. 

44. Method for the development of an inhibitor compound according to any 
one of the claims 15-22, comprising the steps of: 

a) identification of genes involved in cancer, in particular by using retroviral 
insertional tagging, optionally in a specific genetic backgroimd; 
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b) validation of one or more of the identified genes as potential target gene(s) 
for the inhibitor compound by one or more of the following methods: 

- confirmation of the identified gene by Northern Blot analysis in cancer cell- 

hnes; 

5 - determination of the expression profile of the identified gene in tumors and 
normal tissue; 

- determination of the functional importance of the identified genes for cancer; 

c) production of the expression product of the gene; and 

d) use of the expression product of the gene for the production or design of an 
10 inhibitor compound. 

45. Method as claimed in claim 44, wherein the gene identified in step a) is 
a murine genomic region listed in Table 1 or its human homologue. 

46. Method for the identification of genomic regions involved in the 
development of cancer comprising the steps of: 

15 a) performing retroviral insertional mutagenesis of a subject, comprising 

infecting said subject with a txmior inducing retrovirus; 

b) isolating chromosomal DNA firom tumors developed in the infected 
subject; 

c) digesting said chromosomal DNA with a restriction enz5ane capable of 
20 cutting at least once in the DNA sequence of said tumor inducing retrovirus 

and at least once in the chromosomal DNA of said subject; 

d) Ugating the digested DNA to circular DNA; 

e) amplifsnlng the chromosomal DNA fragment flanking the retroviral 
DNA sequence by performing a first PGR reaction with said circtdar DNA 

25 using a first set of retrovirus-specific primers and performing a second nested 
PGR with the product of said first PGR reaction using a second nested set of 
retrovirus-specific primers, and 

f) determining the nucleotide sequence of said chromosomal DNA 
fragment, and optionally comparing said nucleotide sequences with known 
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sequences in a database to yield the genomic region involved in the 
development of cancer. 

47. Method for the identification of common virus integration sites in the 
development of cancer comprising the steps of 

5 a) performing the method of claim 46; 

b) designing genomic region- specific amplification primers; 

c) isolating nucleic acids firom at least two tumors to be analysed for the 
presence of a common virus integration site; 

d) performing an amplification reaction with said nucleic acids usin^ a 
10 set of nested primers comprising genomic region-specific primers and 

retrovirus-specific primers, and 

e) blotting the amplification products and separately hybridizing the 
resulting blot with a retrovirus-specific probe and a genomic region-specific 
probe to determine the presence of common virus integration sites between 

15 said ttmiors. 

48. Set of genomic regions obtainable by a method according to claim 46 or 
47.^ 

49. Set of genomic regions according to claim 48, wherein said genomic 
regions are the murine genomic regions Usted in Table 1. 

20 50. Set of genomic regions according to claim 49, comprising at least 2, 

murine genomic regions selected firom the group consisting of Adamll, Akap7, 
Arpgapl4 , Bomb, Cd24a, Cish2, Cig5, CHc3. Cra, Dermol, EMILIN, Flj20489, 
Galnt5, Hook, Ier5, IL16, Iprgl, Itgp, Kcnk5, h:Tc2, Ltb, MbU, Mrcl, Mtap7, 
Ninj2, Nrldl, Pcdh9, Prdx2, Prpsl, Pdil, Ptgd3, Rgll, Sardh, Scya4, Slcl6A6, 

25 Swap70, Txnip, Trim46, Tnfi:sfl7 and Ubl3. 

51. Set of genomic regions according to claim 49, comprising at least 2, 
murine genomic regions selected firom the group consisting of Cd24a, Cish2, 
Cra, Ltb and Prdx2. 
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